Dynamic-Range Control: Finally in the Hands of Listeners
This story is excerpted from the Radio World ebook “Streaming Best Practices.”
Few people realize that xHE-AAC, the latest generation of the AAC codec family, includes an option for dynamic-range control of both live and recorded audio.
As an audiophile, that discovery made me ask: What if listeners could finally adjust how wide or narrow the loudness range feels — tailoring it to their environment and personal taste — instead of accepting a one-size-fits-all mix?
That led to live internet testing and a perceptual listening trial to understand how this technology behaves in practice. What happened may be the start of a revolution: dynamic-range control moving from the content producer to the listener’s fingertips.
A little background

Controlling the dynamic range of audio — the difference between soft and loud passages — has challenged sound engineers since Edison’s day. His early phonographs hissed so badly that he turned to quieter wax cylinders, the first of many compromises between technical limits and artistic intent.
For most of the 20th century, audio engineers fought those limits. Compressors, limiters and automatic gain circuits were used to keep recording and broadcast levels consistent. Now that digital audio has arrived, noise and headroom are no longer the problem.
Today’s delivery chains can preserve the full dynamic range of a symphony hall, yet many people hear that music on phones, smart speakers or in cars, where the equipment or background noise masks quiet detail. The problem flipped: The medium can deliver more than listeners can comfortably use.
From loudness wars to loudness standards

That drive to control audio also triggered “loudness wars” and uneven loudness and the search for measurement standards.
In 2006 the International Telecommunication Union issued ITU-R BS.1770, defining a reliable method of measuring perceived loudness. Suddenly engineers could quantify what once was subjective.
The Audio Engineering Society followed with Technical Document TD1008 in 2021 (now standard AES 77-2023). It recommends loudness “targets” for different media types — speech, music and album vs. track normalization — so that content from different sources play back at consistent levels.
As editor of TD1008, I remember that the committee saw normalization as only an interim, sending stage. An unsolved problem, we wrote, was “content with wide dynamic range.” Classical music, film scores and acoustic jazz simply didn’t fit a target model, and compressing them risks destroying their expressive quality.
The unfinished work: Listener-side control
When the committee drafted TD1008, MPEG DRC was an obscure, file-based feature within xHE-AAC, and real-time DRC capabilities did not exist for streaming services. Because widespread adoption looked remote, the committee understood that loudness normalization still had to be handled before distribution. Consequently, we acknowledged that this document was an “interim” solution. TD1008 serves as a bridge to a time when listeners will finally have the option to hear music exactly as the artists and producers intended.
That meant shifting the dynamic-range control (DRC) from the producer to the listener. Rather than baking compression into audio for all, codecs in future devices would read standardized metadata and decide — at playout — how much to compress or expand. The same recording could sound natural in a quiet living room and intelligible in rush-hour traffic.
Enter xHE-AAC: Codec with built-in loudness and DRC
That codec is here. xHE-AAC (Extended High-Efficiency AAC), developed by Fraunhofer IIS and standardized by ISO/IEC 23003-3, is already in most major streaming services, smartphones, car infotainment systems and even internet browsers. (xHE-AAC is a registered trademark of Fraunhofer in Germany, the United States and other countries.)
xHE-AAC players are compatible with older AAC versions (although its newer encoding and metadata are not playable on older generation decoders).
Inside each xHE-AAC stream are two key data blocks:
- Loudness Info — objective measurements of program loudness using the ITU BS.1770 algorithm.
- DRC Metadata — a compact set of gain controls that a decoder applies dynamically.
Together, they let the playback devices manage loudness across programs and adapt the dynamic range to the listener’s context — all without altering the source content.
Smartphones could automatically tighten dynamics, going from headphones to the tiny speaker. A connected car system can apply a tighter DRC profile. In the quiet at home, a hi-fi system can play the original digital content with full range for critical listening. Each listener — or device — makes their own choice.
How the metadata works
xHE-AAC defines several DRC “presets” ranging from light to heavy control. The encoder stores them inside the audio, and the decoder carries out the loudness management.

Because encoding sends the original audio to everyone, quality is maintained (xHE-AAC is more efficient than HE-AAC). The original waveform remains intact, only the playback gain envelope changes according to the encoder’s direction. For producers, this means one well-mastered file can satisfy all streaming or podcast listeners.
This mechanism fulfills exactly what TD1008 anticipated. The document foresaw “consumer hardware and software upgrades” that would make dynamic-range metadata practical. Four years later, those upgrades are possible.
xHE-AAC’s implementation aligns with CTA-2075 and could support the loudness standards cited by TD1008. This would help loudness matching between content sources, such as different streams and podcasts.
Live real-time testing
I am honored to work with a small group of volunteers from classical public radio stations WQXR/New York Public Radio, WETA-FM in Washington, and Qbit GmbH and Fraunhofer IIS, both based in Germany.
Live digital audio from the stations’ control rooms were sent in full dynamic range via LPCM to Qbit, which encoded a total of four program sources in xHE-AAC audio on a nearly 7,000-mile round trip on the internet.

The streams ran continuously for most of 2025. An Android player app decoded the xHE-AAC and provided controls for selecting DRC modes.
As an audiophile I was thrilled with the sound, both in pristine form and with the coder’s well-designed DRC.
I found that in my old SUV, the DRC allowed me to enjoy every moment of the classical music and no “reaching for the volume control.” An added benefit is that popular music, which seldom needs DRC because it is carefully mixed and processed, is reproduced faithfully and sounds as intended by the artists and producers.
What do listeners think?
On Nov. 10 we completed a pilot test at New York Public Radio with a group of listeners.

The listener sat in front of a pair of Genelec 1030 monitor speakers, surrounded by four of the same speakers (on the floor) playing road noise recorded at 60 mph in a Lincoln Corsair PHEV on I-66 in Virginia and NYPR’s Mercedes-Benz Sprinter van on the New York City parkway along the Hudson River.
Classical music was selected with varying loudness (drops of >15 LU from loud parts). The road noise at times masked the music, depending on depth of the drops and the noise levels.
The results are being processed now, but data indicate that listeners favor DRC over the same music without DRC under certain circumstances.
In both noise environments, participants stated that it was easier to hear notes and more enjoyable, compared to the original recordings. Some discriminating listeners (recording engineers or musicians) commented on the excellent quality of the DRC.
This was only a pilot, and more extensive testing should be done with a large group of listeners to evaluate the benefits of DRC in a range of conditions, such as earphones, smart phones and smart speakers, as well as popular music.
Looking ahead
The AES Technical Council continues to monitor these advances and plans to revisit TD1008/AES 77 in connection with xHE-AAC. Android and iOS systems and Chromium-based browsers already decode xHE-AAC, so bringing out controls for DRC metadata should be a reasonable step.
It took 140 years to move from Edison’s wax cylinders — plagued by noise and limited range — to a codec that can intelligently shape dynamics for each listener in real time. After more than a century of technology controlling dynamics for the listener, the balance has shifted. Dynamic-Range Control is now literally in the listener’s hands — or more precisely, in their devices.
Many, many thanks go to Dr. Ellyn Sheffield for designing and conducting the perceptual testing, Steve Shultis of New York Public Radio and William Harrison of WETA-FM, Jan and Michael Bläsi of Qbit for encoding and live streaming, and members of Fraunhofer for participating and helping with technical assistance.
[Do you receive the Radio World SmartBrief newsletter each weekday morning? We invite you to sign up here.]
The post Dynamic-Range Control: Finally in the Hands of Listeners appeared first on Radio World.