Apparatus and Method Employing a Perception-Based Distance Metric for Spatial Audio
Simple SummaryContent extracted from patent full text and abstract with AI.
This invention introduces a system and method for improving the processing and rendering of spatial (3D) audio by using a perception-based distance metric. Instead of evaluating physical distances between sound sources or objects, the system uses a sophisticated model that quantifies how differences in spatial audio are perceived by human listeners. The model includes a perceptual coordinate system, a spatial masking model, and a 3D directional loudness map, enabling more efficient clustering and processing of audio objects while maintaining immersive and high-quality audio experiences.
Use CasesContent extracted from patent full text and abstract with AI.
- Virtual reality and augmented reality applications requiring immersive spatial audio rendering.
- Cinema and home theater systems that utilize object-based audio for a 3D sound experience.
- Gaming engines seeking to render believable and efficient three-dimensional soundscapes.
- Audio streaming and storage services optimizing transmission of spatial/object-based audio content.
- Assistive technologies that deliver personalized spatial audio for hearing aids or AR glasses.
- Teleconferencing platforms aiming to provide spatial separation of speakers for clarity.
- Automotive infotainment systems with multi-zone 3D audio experiences.
BenefitsContent extracted from patent full text and abstract with AI.
- Enables real-time processing of complex 3D audio scenes with reduced computational and memory requirements.
- Improves audio quality by clustering audio objects in ways that preserve human perceptual differences, avoiding perceptible distortions.
- Optimizes data storage and transmission for object-based or spatial audio formats by reducing the number of audio objects without noticeable loss in quality.
- Allows for adaptive rendering, quantization, or culling of sound sources based on perceptual irrelevance (i.e., inaudible objects can be removed).
- Facilitates more personalized and realistic audio experiences by utilizing head-related transfer function (HRTF) data.
- Supports a wide range of playback devices, including headphones, speakers, and personalized audio systems.
- Enables robust, scalable, and perceptually-driven sound scene rendering for future audio applications.
Technical Classifications (CPCs)
Main Classifications
Electrical & Electronic Tech
Sub Classifications
Electric Communication Technique
CPC Codes
Inventors & Applicants
Inventors
Applicants
Fraunhofer Ges Forschung
Univ Friedrich Alexander Er
Patent Abstract
An apparatus (100) according to an embodiment is provided. The apparatus comprises an input interface (110) for receiving a plurality of audio objects of an audio sound scene. Moreover, the apparatus (100) comprises a processor (120). Each of the plurality of audio objects represents a sound source being different from any other sound source being represented by any other audio object of the plurality of audio objects; or at least two of the plurality of audio objects represent a same sound source at different locations. The processor (120) is configured to obtain information on a perceptual difference between two audio objects of the plurality of audio objects depending on a distance metric, wherein the distance metric represents perceptual differences in spatial properties of the audio sound scene. And/or, the processor (120) is configured to process the plurality of audio objects to obtain a plurality of audio object clusters or a plurality of processed audio objects depending on the distance metric.
Key Information
Publication No.
EP4346235A1
Family ID
83508456
Publication Date
2024-04-03
Application No.
EP22198848A
Application Date
2022-09-29
Priority Date
2022-09-29
Granted
No
Possible Cooperation
For further information please contact the transfer office.