Show understanding of how sound is represented and encoded
1.2 Multimedia – Sound Representation & Encoding
🎧 1.2.1 What is Sound?
Sound is a vibration that travels through a medium (air, water, solids). Think of it like a ripple in a pond – the ripples move outward, carrying energy. When these ripples reach our ears, they are interpreted as music, speech, or noise.
📊 1.2.2 From Waves to Numbers
To store sound on a computer, we need to turn the continuous wave into a sequence of numbers. This process involves two key steps:
- Sampling – taking a snapshot of the wave at regular time intervals.
- Quantisation – assigning each snapshot a finite number of bits.
⏱️ 1.2.3 Sampling
Sampling rate, denoted $f_s$, is how many times per second we measure the wave. The time between samples is $T_s = \frac{1}{f_s}$.
The Nyquist theorem tells us we must sample at least twice the highest frequency we want to capture: $$ f_s \ge 2 f_{\text{max}} $$ For human hearing (up to 20 kHz), a common choice is $f_s = 44.1$ kHz (CD quality).
🔢 1.2.4 Quantisation
Each sample is rounded to the nearest value that can be represented with $N$ bits. The number of possible levels is $2^N$. For example, 16‑bit audio gives $65\,536$ levels.
The quantisation step size (resolution) is: $$ \Delta = \frac{V_{\text{max}} - V_{\text{min}}}{2^N} $$ where $V_{\text{max}}$ and $V_{\text{min}}$ are the maximum and minimum voltage the ADC can read.
💾 1.2.5 Encoding Formats
| Format | Bits per Sample | Channels | Typical Use |
|---|---|---|---|
| -------- | ----------------- | ---------- | ------------- |
| PCM (uncompressed) | 16 | Mono/ Stereo | CDs, WAV files |
| MP3 | Variable | Stereo | Music streaming |
| AAC | Variable | Stereo | YouTube, podcasts |
| FLAC | 16/24 | Mono/ Stereo | Lossless archiving |
📈 1.2.6 Compression & Lossless vs Lossy
Compression reduces file size by removing redundant or inaudible information.
- Lossless (e.g., FLAC): no data is lost; perfect reconstruction.
- Lossy (e.g., MP3, AAC): some data is discarded; smaller size but slight quality loss.
🗂️ 1.2.7 Example: PCM Sample Table
| Sample # | Time (ms) | Amplitude (µV) |
|---|---|---|
| 1 | 0.000 | 0 |
| 2 | 0.023 | +12 µV |
| 3 | 0.046 | -8 µV |
| 4 | 0.069 | +5 µV |
🎓 1.2.8 Key Takeaways
- Sound is a continuous wave that must be sampled and quantised to be stored digitally.
- The sampling rate determines how accurately we capture high‑frequency content.
- More bits per sample mean finer amplitude resolution.
- Encoding formats balance file size against audio quality.
- Compression can be lossless (perfect fidelity) or lossy (small size, slight loss).
Revision
Log in to practice.