Apparatus, Method and Computer Program for Synthesizing a Spatially Extended Sound Source Using Elementary Spatial Sectors

Publication: WO2023083752A1
Published: 2023-05-19
Family Size: 9
Granted: Yes (1/9)

Simple SummaryContent extracted from patent full text and abstract with AI.

This invention provides a system and method (apparatus, method, and computer program) for more realistically synthesizing or rendering spatially extended sound sources (SESS) in audio environments. Unlike traditional techniques that treat sounds as point sources, this solution breaks the surrounding space into elementary spatial sectors (small regions/angles around a listener) and uses pre-stored rendering data for each sector. The system processes audio signals based on which sectors are relevant to the sound source and the listener's location and orientation. It can also handle modifications such as occlusions (e.g., bushes blocking part of a sound) or distance-based effects by weighting certain sectors accordingly. This results in efficient, realistic 3D and VR audio with accurate modeling of occlusion, source size, and directionality.

Use CasesContent extracted from patent full text and abstract with AI.

  • Virtual reality and augmented reality applications where realistic 3D sound environments are needed.
  • Gaming, to provide immersive and directional soundscapes that respond to player movement.
  • Film and multimedia production, rendering more natural environmental and ambient sounds for surround or binaural playback.
  • Simulation and training environments (e.g., flight simulators, industrial safety training) where accurate spatial audio cues improve realism and user orientation.
  • Hearing aid or audio enhancement devices, to model spatial characteristics of real-world sources for better localization and clarity.
  • Interactive museum or art installations that use spatial audio to create engaging experiences.
  • Broadcast and live event audio where sound needs to be adapted for listeners in different positions and environments.

BenefitsContent extracted from patent full text and abstract with AI.

  • Enables highly realistic, immersive binaural and spatial audio rendering, faithfully reproducing the apparent size, shape, and occlusion of sound sources.
  • Efficient computation and lower memory requirements thanks to pre-calculated, sector-based lookup of rendering data.
  • Dynamic adaptation to listener movement, head orientation, and changes in scene geometry (e.g., objects moving or becoming occluded).
  • Supports complex source and environmental modeling, including selective spatial modifications like occlusions and distance effects, with minimal runtime computational overhead.
  • User-centric and scalable approach: suitable for both headphone and loudspeaker reproduction, in 2D or 3D audio formats.
  • Improves the accuracy of spatial auditory cues (like level, phase, and coherence), enhancing source localization and the quality of virtual sound scenes.
  • Easily extensible and compatible with standardized audio metadata and formats (e.g., MPEG-I, object-based audio).

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Sub Classifications

Electric Communication Technique

CPC Codes

H04S7/302H04S7/303H04S7/307

Inventors & Applicants

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

An apparatus for synthesizing a spatially extended sound source (SESS) (7000), comprises: a storage (200, 2000) for storing rendering data items for different elementary spatial sectors covering a rendering range for a listener; a sector identification processor (4000) for identifying, from the different elementary spatial sectors, a set of elementary spatial sectors belonging to the spatially extended sound source based on listener data and spatially extended sound source data; a target data calculator (5000) for calculating target rendering data from the rendering data items for the set of elementary spatial sectors; and an audio processor (300, 3000) for processing an audio signal representing the spatially extended sound source using the target rendering data.

Key Information

Publication No.

WO2023083752A1

Family ID

78709219

Publication Date

2023-05-19

Application No.

EP2022080996W

Application Date

2022-11-07

Priority Date

2021-11-09

Granted

Yes (1/9)

Possible Cooperation

For further information please contact the transfer office.