Encoder, decoder and methods for backward compatible multi-resolution spatial-audio-object-coding
Simple SummaryContent extracted from patent full text and abstract with AI.
This invention relates to an audio encoder and decoder system for spatial audio object coding that allows for improved multi-resolution representation of audio objects, while maintaining backward compatibility with existing standards. The core idea is to encode and transmit not only standard side information (used for separating and reconstructing audio objects from a downmixed signal), but also enhanced, higher-frequency-resolution side information. The enhanced system can reconstruct audio with finer details, improving the quality, particularly for tonal sounds, while standard decoders can still operate using the basic information. The system dynamically adapts the resolution depending on the signal content and ensures efficient encoding without excessive data rates.
Use CasesContent extracted from patent full text and abstract with AI.
- Streaming and broadcasting of immersive or interactive audio (such as movies, games, VR or AR environments) where users may want to re-position or emphasize specific sound sources.
- Professional audio mixing, mastering, and post-production, allowing engineers to better isolate and manipulate individual audio objects within a mix.
- Teleconferencing or remote meetings, enabling participants’ voices or sounds to be separately rendered and positioned for clarity and intelligibility.
- Music education and karaoke systems, where individual instrument or vocal tracks can be isolated, muted, or emphasized by learners.
- Consumer media players and devices wishing to offer advanced playback features (e.g., adjusting the volume or position of voices or instruments) with compatibility for both old and new audio formats.
BenefitsContent extracted from patent full text and abstract with AI.
- Backwards compatibility – enhanced streams can be decoded by both standard and advanced decoders without loss of base functionality.
- Significantly improved audio quality and separation for object-based audio, especially for complex or tonal signals, by enabling higher-frequency-resolution processing.
- Efficient transmission and storage by encoding fine resolution information in a compact format, minimizing increases in bitrate.
- Flexible adaptation of processing resolution—dynamically tailored for different audio signal characteristics (e.g., high frequency for tonal, high time resolution for transient sounds).
- Enables advanced audio rendering and user interaction without requiring all users to upgrade equipment or software at once.
Technical Classifications (CPCs)
Main Classifications
Electrical & Electronic Tech
Physics & Measurement
Sub Classifications
Electronic Circuitry
Musical Instruments & Acoustics
CPC Codes
Inventors & Applicants
Applicants
Fraunhofer Ges Forschung
Univ Friedrich Alexander Er
Patent Abstract
A decoder for generating an un-mixed audio signal comprising a plurality of un-mixed audio channels is provided. Moreover, an encoder and an encoded audio signal is provided. The decoder comprises an un-mixing-information determiner for determining un-mixing information by receiving first parametric side information on the at least one audio object signal and second parametric side information on the at least one audio object signal, wherein the frequency resolution of the second parametric side information is higher than the frequency resolution of the first parametric side information. Moreover, the decoder comprises an un-mix module for applying the un-mixing information on a downmix signal, indicating a downmix of at least one audio object signal, to obtain an un-mixed audio signal comprising the plurality of un-mixed audio channels. The un-mixing-information determiner is configured to determine the un-mixing information by modifying the first parametric information and the second parametric information to obtain modified parametric information, such that the modified parametric information has a frequency resolution which is higher than the first frequency resolution.
Key Information
Publication No.
EP2717261A1
Family ID
48325510
Publication Date
2014-04-09
Application No.
EP13167485A
Application Date
2013-05-13
Priority Date
2012-10-05
Granted
Yes (17/34)
Possible Cooperation
For further information please contact the transfer office.