Concept for audio encoding and decoding for audio channels and audio objects
Simple SummaryContent extracted from patent full text and abstract with AI.
This patent describes an advanced system and method for audio encoding and decoding that combines traditional audio channels (like stereo or surround sound) and individual audio objects (such as voices or sound effects) along with their spatial metadata. The system flexibly allows for either transmitting raw separate channels and objects or mixing them together before encoding, depending on desired quality, flexibility, and bitrate efficiency. This hybrid approach supports the efficient delivery and high-quality rendering of complex audio scenes, including options for headphone (binaural) or multi-speaker playback.
Use CasesContent extracted from patent full text and abstract with AI.
- Streaming high-quality 3D audio for movies, games, or virtual reality content with dynamic sound placement.
- Efficiently delivering immersive soundscapes for live broadcasts or concerts, adaptable for various playback configurations (e.g., surround sound, stereo, binaural/headphones).
- Audio production and post-production, enabling flexible manipulation and efficient distribution of audio objects and channels.
- Implementing advanced sound playback on mobile devices, optimizing power consumption and decoding complexity depending on device capabilities and output needs.
- Integration in automotive infotainment or advanced home theater systems to provide personalized or spatially adaptive audio experiences.
BenefitsContent extracted from patent full text and abstract with AI.
- Flexible encoding modes allow balancing between audio quality, user interactivity (such as personalized 3D positioning), and data rate efficiency.
- Improved audio compression through pre-mixing and object-based encoding, supporting high-quality output even at low bitrates.
- Enhanced user experience by enabling dynamic rendering of sound sources in multiple reproduction layouts (from simple stereo to complex 22.2-channel setups or binaural headphones).
- Efficient handling and transmission of audio metadata for accurate 3D positioning and properties of audio objects.
- Reduced power consumption and complexity on decoding devices, especially important for mobile and embedded systems.
Technical Classifications (CPCs)
Main Classifications
Electrical & Electronic Tech
Physics & Measurement
Sub Classifications
Electric Communication Technique
Musical Instruments & Acoustics
CPC Codes
Inventors & Applicants
Inventors
Applicants
Fraunhofer Ges Forschung
Friedrich Alexander Universität Erlangen Nürnberg
Patent Abstract
Audio encoder for encoding audio input data (101) to obtain audio output data (501) comprises an input interface (100) for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer (200) for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel comprising audio data of a channel and audio data of at least one object; a core encoder (300) for core encoding core encoder input data; and a metadata compressor (400) for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes comprising a first mode, in which the core encoder is configured to encode the plurality of audio channels and the plurality of audio objects received by the input interface as core encoder input data, and a second mode, in which the core encoder (300) is configured for receiving, as the core encoder input data, the plurality of pre-mixed channels generated by the mixer (200).
Key Information
Publication No.
EP2830045A1
Family ID
48803456
Publication Date
2015-01-28
Application No.
EP13177378A
Application Date
2013-07-22
Priority Date
2013-07-22
Granted
Yes (21/44)
Possible Cooperation
For further information please contact the transfer office.