How Digital Data Works

Req 3a — Digitizing Text, Sound & Images

3a.
Explain to your counselor how text, sound, and pictures are digitized for storage.

Everything on your phone — every song, every photo, every text message — is stored as nothing more than long strings of ones and zeros. It seems impossible that two simple digits could capture the richness of a symphony or the detail of a photograph, but that is exactly what digitization does. Understanding how it works is one of the most fundamental concepts in all of computing.

What Is Digitization?

Digitization is the process of converting information from the real world (analog) into digital form (binary numbers). The word “digital” comes from “digit” — a number. At its core, every piece of digital data is represented using the binary number system, which uses only two digits: 0 and 1. Each 0 or 1 is called a bit (short for “binary digit”). Eight bits grouped together form a byte.

How Text Is Digitized

Computers store text by assigning a unique number to every character. The earliest system, called ASCII (American Standard Code for Information Interchange), uses 7 bits to represent 128 characters — the English alphabet (uppercase and lowercase), numbers 0–9, punctuation marks, and special control characters.

For example:

ASCII worked fine for English, but it could not represent characters from other languages. Unicode expanded the system to handle over 149,000 characters from virtually every writing system on Earth — including Chinese, Arabic, Hindi, emoji, and even ancient Egyptian hieroglyphics.

How Sound Is Digitized

Sound in the real world is a continuous wave — air molecules vibrating at different frequencies and amplitudes. To digitize sound, a computer takes thousands of samples of the sound wave every second, measuring its amplitude at each point. This process is called sampling.

The two key factors in digital audio quality are:

Think of it like a flipbook: each page shows a single frame, and when you flip through them fast enough, you see smooth motion. Audio sampling works the same way — enough snapshots played back fast enough sound like continuous music.

Diagram showing a smooth analog sound wave being converted to a stepped digital representation with sampling points

How Images Are Digitized

A digital image is made of tiny dots called pixels (short for “picture elements”). Each pixel stores color information as numbers. The more pixels in an image, the more detail it can show — this is why camera specs mention megapixels (millions of pixels).

For color images, each pixel’s color is typically described using three values: the amount of red, green, and blue (RGB) light. Each color channel usually uses 8 bits (values from 0 to 255), giving a total of 24 bits per pixel and 16.7 million possible colors.

For example:

Diagram showing a forest photograph zoomed in to reveal individual colored pixels with RGB values

Putting It All Together

Data TypeHow It Is DigitizedKey Units
TextEach character assigned a binary number (ASCII/Unicode)Bits per character (7 for ASCII, up to 32 for Unicode)
SoundContinuous wave sampled thousands of times per secondSample rate (Hz) and bit depth
ImagesGrid of pixels, each with RGB color valuesResolution (megapixels) and color depth (bits per pixel)

The key idea across all three: digitization converts continuous real-world information into discrete numbers that computers can store, process, and transmit. Some detail is always lost in this conversion — a digital photo is not the actual scene, and a digital recording is not the actual performance — but with enough samples, bits, and pixels, the digital version is virtually indistinguishable from the original.

Code.org — How Computers Work: Binary & Data A playlist of short, engaging videos explaining binary, data representation, and how computers process information.

Now that you know how data gets into a computer, the next question is: how do you make all that data smaller so it does not fill up your storage? That is where compression comes in.