Apparatus and method for realizing a SAOC downmix of 3D audio content

Publication: EP2830048A1
Published: 2015-01-28
Family Size: 68
Granted: Yes (34/68)

Simple SummaryContent extracted from patent full text and abstract with AI.

This patent describes an apparatus and method for efficiently downmixing 3D audio content using Spatial Audio Object Coding (SAOC). The invention enables multiple individual audio object signals to be mixed into a smaller number of channels for transmission or storage, using specific mixing rules. At the receiver, these rules—along with associated metadata—allow the reconstruction of high-quality multi-channel or object-based 3D audio suitable for flexible playback setups. The approach achieves a significant reduction in data size while preserving audio spatial integrity and flexibility in output rendering formats.

Use CasesContent extracted from patent full text and abstract with AI.

  • Efficient streaming of high-quality 3D or immersive audio content over bandwidth-constrained networks (e.g., online music/video platforms).
  • Storage of 3D audio scenes for movies, games, or virtual/augmented reality experiences in reduced file sizes.
  • Live broadcasting of object-based audio (such as sports events or concerts) where listeners can customize their audio mix at home.
  • Deployment in consumer home theater systems, allowing for flexible loudspeaker setups and immersive playback regardless of speaker count or placement.
  • Integration into professional audio production and post-production tools to enable collaborative object-based mixing and distribution.

BenefitsContent extracted from patent full text and abstract with AI.

  • Significantly reduces the data rate required to deliver or store complex 3D audio scenes without substantial loss of spatial image quality.
  • Allows flexibility in decoding, enabling end-users to render audio according to their specific playback setup (from binaural headphones to large multi-speaker arrays).
  • Enables object-based audio rendering, which supports personalization (e.g., adjusting the volume/location of individual sound sources).
  • Compatible with legacy audio devices for basic playback, while providing advanced rendering on modern systems.
  • Facilitates efficient and scalable audio distribution for various applications, including streaming, broadcasting, and archival storage.

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Physics & Measurement

Sub Classifications

Electric Communication Technique

Musical Instruments & Acoustics

CPC Codes

G10L19/008H04S3/00H04S3/006H04S3/008H04S3/02H04S7/305

Inventors & Applicants

Applicants

Fraunhofer Ges Forschung

Friedrich Alexander Universität Erlangen Nürnberg

Patent Abstract

An apparatus for generating one or more audio output channels is provided. The apparatus comprises a parameter processor (110) for calculating output channel mixing information and a downmix processor (120) for generating the one or more audio output channels. The downmix processor (120) is configured to receive an audio transport signal comprising one or more audio transport channels, wherein two or more audio object signals are mixed within the audio transport signal, and wherein the number of the one or more audio transport channels is smaller than the number of the two or more audio object signals. The audio transport signal depends on a first mixing rule and on a second mixing rule. The first mixing rule indicates how to mix the two or more audio object signals to obtain a plurality of premixed channels. Moreover, the second mixing rule indicates how to mix the plurality of premixed channels to obtain the one or more audio transport channels of the audio transport signal. The parameter processor (110) is configured to receive information on the second mixing rule, wherein the information on the second mixing rule indicates how to mix the plurality of premixed signals such that the one or more audio transport channels are obtained. Moreover, the parameter processor (110) is configured to calculate the output channel mixing information depending on an audio objects number indicating the number of the two or more audio object signals, depending on a premixed channels number indicating the number of the plurality of premixed channels, and depending on the information on the second mixing rule. The downmix processor (120) is configured to generate the one or more audio output channels from the audio transport signal depending on the output channel mixing information.

Key Information

Publication No.

EP2830048A1

Family ID

49385153

Publication Date

2015-01-28

Application No.

EP13189281A

Application Date

2013-10-18

Priority Date

2013-07-22

Granted

Yes (34/68)

Possible Cooperation

For further information please contact the transfer office.