Audio Scene Encoder, Audio Scene Decoder and Related Methods Using Hybrid Encoder/decoder Spatial Analysis
Simple SummaryContent extracted from patent full text and abstract with AI.
This patent describes a hybrid audio scene encoding and decoding system for 3D (spatial) audio. The audio scene is split into multiple portions (by frequency or time), where some are encoded with detailed (waveform-preserving) methods, while others use more efficient, parametric methods. In addition, for certain spectral or temporal segments, spatial audio parameters (such as direction and diffuseness) are estimated and transmitted from the encoder, while for other segments these spatial parameters are computed at the decoder. This approach allows optimizing the trade-off between bitrate, audio quality, and the accuracy of spatial cues for different parts of the audio signal, leading to more efficient and higher-quality spatial audio reproduction, particularly at lower bitrates.
Use CasesContent extracted from patent full text and abstract with AI.
- Delivering immersive 3D audio content in streaming platforms (e.g., movies, VR, live music) with limited bandwidth.
- Broadcasting surround sound or Ambisonics audio over radio, TV, or the internet efficiently.
- Encoding, storing, and transmitting high-quality spatial audio for gaming or virtual reality experiences.
- Implementing efficient multi-channel audio in low-power or memory-constrained devices such as mobile phones or portable speakers.
- Enabling teleconferencing systems with spatialized audio to enhance clarity and immersion.
- Archiving and distributing 3D sound for museums, education, or simulation environments.
BenefitsContent extracted from patent full text and abstract with AI.
- Enables high-quality 3D/spatial audio reproduction at reduced bitrates, lowering storage and transmission costs.
- Maintains important spatial cues (direction and diffuseness), leading to a more realistic and immersive listening experience.
- Offers a flexible trade-off between audio quality, spatial resolution, and data rate according to application needs.
- Reduces artifacts and inaccuracies in spatial audio reconstruction, especially for portions of the signal encoded with parametric methods.
- Allows adaptive encoding to varying bandwidth and processing constraints, benefiting both consumer and professional audio applications.
- Compatible with different audio scene formats (e.g., channel-based, object-based, Ambisonics), increasing versatility.
Technical Classifications (CPCs)
Main Classifications
Electrical & Electronic Tech
Physics & Measurement
Sub Classifications
Electric Communication Technique
Musical Instruments & Acoustics
CPC Codes
Inventors & Applicants
Inventors
Applicants
Fraunhofer Ges Forschung
Univ Friedrich Alexander Er
Patent Abstract
An audio scene encoder for encoding an audio scene, the audio scene comprising at least two component signals, comprises: a core encoder (160) for core encoding the at least two component signals, wherein the core encoder (160) is configured to generate a first encoded representation (310) for a first portion of the at least two component signals, and to generate a second encoded representation (320) for a second portion of the at least two component signals, a spatial analyzer (200) for analyzing the audio scene to derive one or more spatial parameters (330) or one or more spatial parameter sets for the second portion; and an output interface (300) for forming the encoded audio scene signal (340), the encoded audio scene signal (340) comprising the first encoded representation (310), the second encoded representation (320), and the one or more spatial parameters (330) or one or more spatial parameter sets for the second portion.
Key Information
Publication No.
WO2019149845A1
Family ID
65276183
Publication Date
2019-08-08
Application No.
EP2019052428W
Application Date
2019-01-31
Priority Date
2018-02-01
Granted
Yes (12/30)
Possible Cooperation
For further information please contact the transfer office.