Decoder and Method for a Generalized Spatial-Audio-Object-Coding Parametric Concept for Multichannel Downmix/upmix Cases

Publication: WO2014020182A2

Published: 2014-02-06

Family Size: 32

Granted: Yes (16/32)

Simple SummaryContent extracted from patent full text and abstract with AI.

This patent describes an improved audio decoder and method for spatial audio object coding, which can handle multichannel audio signals more flexibly and reliably. The invention adapts the threshold used for separating audio components based on the signal and noise energy of both individual audio objects and downmix channels. By dynamically adjusting this threshold for different time-frequency sections of audio, the decoder can better separate and render audio objects from complex, multi-channel downmixes with fewer artifacts and improved quality.

Use CasesContent extracted from patent full text and abstract with AI.

Home theater systems and soundbars that need to upmix stereo or multi-channel audio to more immersive formats like 5.1, 7.1, or 3D audio.
Professional audio production and post-production environments where individual audio objects (e.g., instruments, voices) need to be re-mixed or spatialized after transmission or storage.
Live broadcasting (e.g., sports, concerts) where end-users might want to adjust commentator and atmosphere levels independently.
Virtual reality (VR) and augmented reality (AR) applications needing precise, dynamic spatial audio rendering based on user position or preference.
Teleconferencing systems where separating and enhancing different speakers' voices improves intelligibility and meeting experiences.
Music education and karaoke applications enabling users to isolate, learn, or mute specific musical parts.

BenefitsContent extracted from patent full text and abstract with AI.

Enables more flexible and accurate separation and rendering of audio objects from multi-channel downmixes, leading to improved audio quality.
Reduces common artifacts during audio object separation by adaptively setting thresholds according to real-time signal and noise measurements.
Works with an arbitrary number of audio channels and is compatible with existing and future audio coding standards.
Supports per-time-frequency adaptation, allowing for optimized separation in dynamic and complex audio content.
Allows end-users greater customization of their audio experience (e.g., changing instrument or voice levels, spatial positioning).
Improves bitrate efficiency in transmission and storage by enabling robust object-based reconstruction from fewer transmitted channels.
Facilitates new interactive and immersive audio applications in entertainment, VR/AR, communication, and education.

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Physics & Measurement

Sub Classifications

Electric Communication Technique

Musical Instruments & Acoustics

CPC Codes

G10L13/07G10L19/008G10L19/02H04S1/007H04S3/02H04S5/02

Inventors & Applicants

Inventors

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal comprising one or more downmix channels is provided. The downmix signal encodes one or more audio object signals. The decoder comprises a threshold determiner (110) for determining a threshold value depending on a signal energy and/or a noise energy of at least one of the of or more audio object signals and/or depending on a signal energy and/or a noise energy of at least one of the one or more downmix channels. Moreover, the decoder comprises a processing unit (120) for generating the one or more audio output channels from the one or more downmix channels depending on the threshold value.

Key Information

Publication No.

WO2014020182A2

Family ID

49150906

Publication Date

2014-02-06

Application No.

EP2013066405W

Application Date

2013-08-05

Priority Date

2012-08-03

Granted

Yes (16/32)

Possible Cooperation

For further information please contact the transfer office.

See full document in Espacenet