Apparatus and Method Employing a Perception-Based Distance Metric for Spatial Audio

Publication: EP4346235A1
Published: 2024-04-03
Family Size: 2
Granted: No

Simple SummaryContent extracted from patent full text and abstract with AI.

This invention introduces a system and method for improving the processing and rendering of spatial (3D) audio by using a perception-based distance metric. Instead of evaluating physical distances between sound sources or objects, the system uses a sophisticated model that quantifies how differences in spatial audio are perceived by human listeners. The model includes a perceptual coordinate system, a spatial masking model, and a 3D directional loudness map, enabling more efficient clustering and processing of audio objects while maintaining immersive and high-quality audio experiences.

Use CasesContent extracted from patent full text and abstract with AI.

  • Virtual reality and augmented reality applications requiring immersive spatial audio rendering.
  • Cinema and home theater systems that utilize object-based audio for a 3D sound experience.
  • Gaming engines seeking to render believable and efficient three-dimensional soundscapes.
  • Audio streaming and storage services optimizing transmission of spatial/object-based audio content.
  • Assistive technologies that deliver personalized spatial audio for hearing aids or AR glasses.
  • Teleconferencing platforms aiming to provide spatial separation of speakers for clarity.
  • Automotive infotainment systems with multi-zone 3D audio experiences.

BenefitsContent extracted from patent full text and abstract with AI.

  • Enables real-time processing of complex 3D audio scenes with reduced computational and memory requirements.
  • Improves audio quality by clustering audio objects in ways that preserve human perceptual differences, avoiding perceptible distortions.
  • Optimizes data storage and transmission for object-based or spatial audio formats by reducing the number of audio objects without noticeable loss in quality.
  • Allows for adaptive rendering, quantization, or culling of sound sources based on perceptual irrelevance (i.e., inaudible objects can be removed).
  • Facilitates more personalized and realistic audio experiences by utilizing head-related transfer function (HRTF) data.
  • Supports a wide range of playback devices, including headphones, speakers, and personalized audio systems.
  • Enables robust, scalable, and perceptually-driven sound scene rendering for future audio applications.

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Sub Classifications

Electric Communication Technique

CPC Codes

H04S7/30

Inventors & Applicants

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

An apparatus (100) according to an embodiment is provided. The apparatus comprises an input interface (110) for receiving a plurality of audio objects of an audio sound scene. Moreover, the apparatus (100) comprises a processor (120). Each of the plurality of audio objects represents a sound source being different from any other sound source being represented by any other audio object of the plurality of audio objects; or at least two of the plurality of audio objects represent a same sound source at different locations. The processor (120) is configured to obtain information on a perceptual difference between two audio objects of the plurality of audio objects depending on a distance metric, wherein the distance metric represents perceptual differences in spatial properties of the audio sound scene. And/or, the processor (120) is configured to process the plurality of audio objects to obtain a plurality of audio object clusters or a plurality of processed audio objects depending on the distance metric.

Key Information

Publication No.

EP4346235A1

Family ID

83508456

Publication Date

2024-04-03

Application No.

EP22198848A

Application Date

2022-09-29

Priority Date

2022-09-29

Granted

No

Possible Cooperation

For further information please contact the transfer office.