Decoder and Method for a Generalized Spatial-Audio-Object-Coding Parametric Concept for Multichannel Downmix/upmix Cases
Simple SummaryContent extracted from patent full text and abstract with AI.
This patent describes an improved audio decoder and method for spatial audio object coding, which can handle multichannel audio signals more flexibly and reliably. The invention adapts the threshold used for separating audio components based on the signal and noise energy of both individual audio objects and downmix channels. By dynamically adjusting this threshold for different time-frequency sections of audio, the decoder can better separate and render audio objects from complex, multi-channel downmixes with fewer artifacts and improved quality.
Use CasesContent extracted from patent full text and abstract with AI.
- Home theater systems and soundbars that need to upmix stereo or multi-channel audio to more immersive formats like 5.1, 7.1, or 3D audio.
- Professional audio production and post-production environments where individual audio objects (e.g., instruments, voices) need to be re-mixed or spatialized after transmission or storage.
- Live broadcasting (e.g., sports, concerts) where end-users might want to adjust commentator and atmosphere levels independently.
- Virtual reality (VR) and augmented reality (AR) applications needing precise, dynamic spatial audio rendering based on user position or preference.
- Teleconferencing systems where separating and enhancing different speakers' voices improves intelligibility and meeting experiences.
- Music education and karaoke applications enabling users to isolate, learn, or mute specific musical parts.
BenefitsContent extracted from patent full text and abstract with AI.
- Enables more flexible and accurate separation and rendering of audio objects from multi-channel downmixes, leading to improved audio quality.
- Reduces common artifacts during audio object separation by adaptively setting thresholds according to real-time signal and noise measurements.
- Works with an arbitrary number of audio channels and is compatible with existing and future audio coding standards.
- Supports per-time-frequency adaptation, allowing for optimized separation in dynamic and complex audio content.
- Allows end-users greater customization of their audio experience (e.g., changing instrument or voice levels, spatial positioning).
- Improves bitrate efficiency in transmission and storage by enabling robust object-based reconstruction from fewer transmitted channels.
- Facilitates new interactive and immersive audio applications in entertainment, VR/AR, communication, and education.
Technical Classifications (CPCs)
Main Classifications
Electrical & Electronic Tech
Physics & Measurement
Sub Classifications
Electric Communication Technique
Musical Instruments & Acoustics
CPC Codes
Inventors & Applicants
Applicants
Fraunhofer Ges Forschung
Univ Friedrich Alexander Er
Patent Abstract
A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal comprising one or more downmix channels is provided. The downmix signal encodes one or more audio object signals. The decoder comprises a threshold determiner (110) for determining a threshold value depending on a signal energy and/or a noise energy of at least one of the of or more audio object signals and/or depending on a signal energy and/or a noise energy of at least one of the one or more downmix channels. Moreover, the decoder comprises a processing unit (120) for generating the one or more audio output channels from the one or more downmix channels depending on the threshold value.
Key Information
Publication No.
WO2014020182A2
Family ID
49150906
Publication Date
2014-02-06
Application No.
EP2013066405W
Application Date
2013-08-05
Priority Date
2012-08-03
Granted
Yes (16/32)
Possible Cooperation
For further information please contact the transfer office.