Apparatus and method for realizing a SAOC downmix of 3D audio content
Simple SummaryContent extracted from patent full text and abstract with AI.
This patent describes an apparatus and method for efficiently downmixing 3D audio content using Spatial Audio Object Coding (SAOC). The invention enables multiple individual audio object signals to be mixed into a smaller number of channels for transmission or storage, using specific mixing rules. At the receiver, these rules—along with associated metadata—allow the reconstruction of high-quality multi-channel or object-based 3D audio suitable for flexible playback setups. The approach achieves a significant reduction in data size while preserving audio spatial integrity and flexibility in output rendering formats.
Use CasesContent extracted from patent full text and abstract with AI.
- Efficient streaming of high-quality 3D or immersive audio content over bandwidth-constrained networks (e.g., online music/video platforms).
- Storage of 3D audio scenes for movies, games, or virtual/augmented reality experiences in reduced file sizes.
- Live broadcasting of object-based audio (such as sports events or concerts) where listeners can customize their audio mix at home.
- Deployment in consumer home theater systems, allowing for flexible loudspeaker setups and immersive playback regardless of speaker count or placement.
- Integration into professional audio production and post-production tools to enable collaborative object-based mixing and distribution.
BenefitsContent extracted from patent full text and abstract with AI.
- Significantly reduces the data rate required to deliver or store complex 3D audio scenes without substantial loss of spatial image quality.
- Allows flexibility in decoding, enabling end-users to render audio according to their specific playback setup (from binaural headphones to large multi-speaker arrays).
- Enables object-based audio rendering, which supports personalization (e.g., adjusting the volume/location of individual sound sources).
- Compatible with legacy audio devices for basic playback, while providing advanced rendering on modern systems.
- Facilitates efficient and scalable audio distribution for various applications, including streaming, broadcasting, and archival storage.
Technical Classifications (CPCs)
Main Classifications
Electrical & Electronic Tech
Physics & Measurement
Sub Classifications
Electric Communication Technique
Musical Instruments & Acoustics
CPC Codes
Inventors & Applicants
Inventors
Applicants
Fraunhofer Ges Forschung
Friedrich Alexander Universität Erlangen Nürnberg
Patent Abstract
An apparatus for generating one or more audio output channels is provided. The apparatus comprises a parameter processor (110) for calculating output channel mixing information and a downmix processor (120) for generating the one or more audio output channels. The downmix processor (120) is configured to receive an audio transport signal comprising one or more audio transport channels, wherein two or more audio object signals are mixed within the audio transport signal, and wherein the number of the one or more audio transport channels is smaller than the number of the two or more audio object signals. The audio transport signal depends on a first mixing rule and on a second mixing rule. The first mixing rule indicates how to mix the two or more audio object signals to obtain a plurality of premixed channels. Moreover, the second mixing rule indicates how to mix the plurality of premixed channels to obtain the one or more audio transport channels of the audio transport signal. The parameter processor (110) is configured to receive information on the second mixing rule, wherein the information on the second mixing rule indicates how to mix the plurality of premixed signals such that the one or more audio transport channels are obtained. Moreover, the parameter processor (110) is configured to calculate the output channel mixing information depending on an audio objects number indicating the number of the two or more audio object signals, depending on a premixed channels number indicating the number of the plurality of premixed channels, and depending on the information on the second mixing rule. The downmix processor (120) is configured to generate the one or more audio output channels from the audio transport signal depending on the output channel mixing information.
Key Information
Publication No.
EP2830048A1
Family ID
49385153
Publication Date
2015-01-28
Application No.
EP13189281A
Application Date
2013-10-18
Priority Date
2013-07-22
Granted
Yes (34/68)
Possible Cooperation
For further information please contact the transfer office.