Apparatus and Method for Encoding a Plurality of Audio Objects or Apparatus and Method for Decoding Using Two or More Relevant Audio Objects

Publication: WO2022079049A2

Published: 2022-04-21

Family Size: 16

Granted: Yes (3/16)

Simple SummaryContent extracted from patent full text and abstract with AI.

The invention discloses an apparatus and method for efficiently encoding and decoding audio signals made up of multiple audio objects. Instead of encoding every audio object individually (which is bit-rate intensive), this approach selectively identifies the most relevant audio objects in each time-frequency segment, computes key parameters (like power ratios and direction), and combines this information with a downmixed audio signal. The encoded data allows for high-quality reconstruction of spatial audio with fewer bits and lower computational complexity, supporting immersive and interactive audio applications.

Use CasesContent extracted from patent full text and abstract with AI.

Streaming immersive audio for movies, video games, and virtual/augmented reality platforms, where dozens of audio objects move in 3D space around the listener.
Teleconferencing systems requiring efficient spatial audio reproduction to create realistic environments and accurately localize speakers.
Interactive audio applications such as 360-degree videos or live events that demand efficient, high-quality object-based audio rendering.
Broadcasting technologies (radio, television, internet) that must deliver object-based spatial audio to a wide variety of listener configurations without excessive data rates.
Storage and playback systems for multi-object soundtracks that need to balance storage requirements and playback fidelity across diverse devices.

BenefitsContent extracted from patent full text and abstract with AI.

Significant bit-rate savings by encoding only the most perceptually relevant audio objects per time-frequency segment, rather than all objects at all times.
Enables high-quality immersive audio reproduction (spatial localization and realism) at lower bitrates, making it suitable for streaming and bandwidth-constrained scenarios.
Adaptable to different output playback configurations (stereo, 5.1, 7.1, headphones, etc.), since object directions and power information are preserved.
Reduces computational complexity in both encoding and decoding compared to previous methods, via optimized covariance synthesis and parameter grouping.
Supports scalability to a large number of audio objects without a linear increase in data size, enabling richer content in modern multimedia applications.

Technical Classifications (CPCs)

Main Classifications

Physics & Measurement

Sub Classifications

Musical Instruments & Acoustics

CPC Codes

G10L19/008G10L19/02G10L25/03

Inventors & Applicants

Inventors

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

Apparatus for encoding a plurality of audio objects, comprising: an object parameter calculator (100) configured for calculating, for one or more frequency bins of a plurality of frequency bins related to a time frame, parameter data for at least two relevant audio objects, wherein a number of the at least two relevant audio objects is lower than a total number of the plurality of audio objects, and an output interface (200) for outputting an encoded audio signal comprising information on the parameter data for the at least two relevant audio objects for the one or more frequency bins.

Key Information

Publication No.

WO2022079049A2

Family ID

78087392

Publication Date

2022-04-21

Application No.

EP2021078217W

Application Date

2021-10-12

Priority Date

2020-10-13

Granted

Yes (3/16)

Possible Cooperation

For further information please contact the transfer office.

See full document in Espacenet