Apparatus and Method Employing a Perception-Based Distance Metric for Spatial Audio

Publication: EP4346235A1

Published: 2024-04-03

Family Size: 2

Granted: No

Simple SummaryContent extracted from patent full text and abstract with AI.

This invention introduces a system and method for improving the processing and rendering of spatial (3D) audio by using a perception-based distance metric. Instead of evaluating physical distances between sound sources or objects, the system uses a sophisticated model that quantifies how differences in spatial audio are perceived by human listeners. The model includes a perceptual coordinate system, a spatial masking model, and a 3D directional loudness map, enabling more efficient clustering and processing of audio objects while maintaining immersive and high-quality audio experiences.

Use CasesContent extracted from patent full text and abstract with AI.

Virtual reality and augmented reality applications requiring immersive spatial audio rendering.
Cinema and home theater systems that utilize object-based audio for a 3D sound experience.
Gaming engines seeking to render believable and efficient three-dimensional soundscapes.
Audio streaming and storage services optimizing transmission of spatial/object-based audio content.
Assistive technologies that deliver personalized spatial audio for hearing aids or AR glasses.
Teleconferencing platforms aiming to provide spatial separation of speakers for clarity.
Automotive infotainment systems with multi-zone 3D audio experiences.

BenefitsContent extracted from patent full text and abstract with AI.

Enables real-time processing of complex 3D audio scenes with reduced computational and memory requirements.
Improves audio quality by clustering audio objects in ways that preserve human perceptual differences, avoiding perceptible distortions.
Optimizes data storage and transmission for object-based or spatial audio formats by reducing the number of audio objects without noticeable loss in quality.
Allows for adaptive rendering, quantization, or culling of sound sources based on perceptual irrelevance (i.e., inaudible objects can be removed).
Facilitates more personalized and realistic audio experiences by utilizing head-related transfer function (HRTF) data.
Supports a wide range of playback devices, including headphones, speakers, and personalized audio systems.
Enables robust, scalable, and perceptually-driven sound scene rendering for future audio applications.

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Sub Classifications

Electric Communication Technique

CPC Codes

H04S7/30

Inventors & Applicants

Inventors

Sascha Dick

Jürgen Herre

Pablo Delgado

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

An apparatus (100) according to an embodiment is provided. The apparatus comprises an input interface (110) for receiving a plurality of audio objects of an audio sound scene. Moreover, the apparatus (100) comprises a processor (120). Each of the plurality of audio objects represents a sound source being different from any other sound source being represented by any other audio object of the plurality of audio objects; or at least two of the plurality of audio objects represent a same sound source at different locations. The processor (120) is configured to obtain information on a perceptual difference between two audio objects of the plurality of audio objects depending on a distance metric, wherein the distance metric represents perceptual differences in spatial properties of the audio sound scene. And/or, the processor (120) is configured to process the plurality of audio objects to obtain a plurality of audio object clusters or a plurality of processed audio objects depending on the distance metric.

Key Information

Publication No.

EP4346235A1

Family ID

83508456

Publication Date

2024-04-03

Application No.

EP22198848A

Application Date

2022-09-29

Priority Date

2022-09-29

Granted

Possible Cooperation

For further information please contact the transfer office.

See full document in Espacenet