Apparatus and Method for Encoding a Plurality of Audio Objects or Apparatus and Method for Decoding Using Two or More Relevant Audio Objects
Simple SummaryContent extracted from patent full text and abstract with AI.
The invention discloses an apparatus and method for efficiently encoding and decoding audio signals made up of multiple audio objects. Instead of encoding every audio object individually (which is bit-rate intensive), this approach selectively identifies the most relevant audio objects in each time-frequency segment, computes key parameters (like power ratios and direction), and combines this information with a downmixed audio signal. The encoded data allows for high-quality reconstruction of spatial audio with fewer bits and lower computational complexity, supporting immersive and interactive audio applications.
Use CasesContent extracted from patent full text and abstract with AI.
- Streaming immersive audio for movies, video games, and virtual/augmented reality platforms, where dozens of audio objects move in 3D space around the listener.
- Teleconferencing systems requiring efficient spatial audio reproduction to create realistic environments and accurately localize speakers.
- Interactive audio applications such as 360-degree videos or live events that demand efficient, high-quality object-based audio rendering.
- Broadcasting technologies (radio, television, internet) that must deliver object-based spatial audio to a wide variety of listener configurations without excessive data rates.
- Storage and playback systems for multi-object soundtracks that need to balance storage requirements and playback fidelity across diverse devices.
BenefitsContent extracted from patent full text and abstract with AI.
- Significant bit-rate savings by encoding only the most perceptually relevant audio objects per time-frequency segment, rather than all objects at all times.
- Enables high-quality immersive audio reproduction (spatial localization and realism) at lower bitrates, making it suitable for streaming and bandwidth-constrained scenarios.
- Adaptable to different output playback configurations (stereo, 5.1, 7.1, headphones, etc.), since object directions and power information are preserved.
- Reduces computational complexity in both encoding and decoding compared to previous methods, via optimized covariance synthesis and parameter grouping.
- Supports scalability to a large number of audio objects without a linear increase in data size, enabling richer content in modern multimedia applications.
Technical Classifications (CPCs)
Main Classifications
Physics & Measurement
Sub Classifications
Musical Instruments & Acoustics
CPC Codes
Inventors & Applicants
Inventors
Applicants
Fraunhofer Ges Forschung
Univ Friedrich Alexander Er
Patent Abstract
Apparatus for encoding a plurality of audio objects, comprising: an object parameter calculator (100) configured for calculating, for one or more frequency bins of a plurality of frequency bins related to a time frame, parameter data for at least two relevant audio objects, wherein a number of the at least two relevant audio objects is lower than a total number of the plurality of audio objects, and an output interface (200) for outputting an encoded audio signal comprising information on the parameter data for the at least two relevant audio objects for the one or more frequency bins.
Key Information
Publication No.
WO2022079049A2
Family ID
78087392
Publication Date
2022-04-21
Application No.
EP2021078217W
Application Date
2021-10-12
Priority Date
2020-10-13
Granted
Yes (3/16)
Possible Cooperation
For further information please contact the transfer office.