Apparatus and method for merging geometry - based spatial audio coding streams

Publication: EP2600343A1
Published: 2013-06-05
Family Size: 36
Granted: Yes (14/36)

Simple SummaryContent extracted from patent full text and abstract with AI.

This patent describes an apparatus and method for combining (merging) multiple geometry-based spatial audio coding streams into a single, unified audio data stream. The system is able to demultiplex complex multi-layer audio streams into simpler single-layer streams and then merge them intelligently based on audio parameters such as sound pressure, position, and diffuseness. This allows the creation of a customized or composite sound scene from different audio sources or environments, providing flexibility in sound scene manipulation and efficient audio data handling—for example, for immersive audio applications.

Use CasesContent extracted from patent full text and abstract with AI.

  • Teleconferencing or videoconferencing systems where multiple participants in different environments need to be seamlessly integrated into a single virtual sound space.
  • Merging sound scenes from different recording environments in media production (film, TV, VR).
  • Interactive audio experiences in virtual reality (VR) or augmented reality (AR), where dynamic spatial audio scenes are composed on the fly.
  • Live event broadcasting, where multiple stage or room audio sources are combined for an enhanced spatial audio experience for remote listeners.
  • Game audio engines, enabling the combination of multiple in-game sound environments.
  • Post-production audio mixing for creating immersive soundtracks by merging different audio recordings.

BenefitsContent extracted from patent full text and abstract with AI.

  • Allows flexible and efficient merging of spatial audio streams from different sources or environments.
  • Enables real-time manipulation of the spatial properties (position, orientation, diffuseness) of sound sources within merged scenes.
  • Supports high-quality, immersive spatial audio experiences for end-users, regardless of loudspeaker setups or listening positions.
  • Reduces network and computational load by merging streams before synthesis or transmission, optimizing bandwidth usage.
  • Permits insertion of artificial (synthetic) sound sources for mixed reality or enhanced audio effects.
  • Facilitates advanced audio scene editing, including selective enhancement, suppression, or movement of sound sources, thus supporting a wide range of applications from teleconferencing to entertainment.

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Physics & Measurement

Sub Classifications

Computing & Calculating

Electric Communication Technique

Musical Instruments & Acoustics

CPC Codes

G06F18/2134G10L19/008G10L19/18G10L21/0272H04R5/00H04S3/008H04S7/30

Inventors & Applicants

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

An apparatus for generating a merged audio data stream is provided. The apparatus comprises a demultiplexer (180) for obtaining a plurality of single-layer audio data streams, wherein the demultiplexer (180) is adapted to receive one or more input audio data streams, wherein each input audio data stream comprises one or more layers, wherein the demultiplexer (180) is adapted to demultiplex each one of the input audio data streams having one or more layers into two or more demultiplexed audio data streams having exactly one layer, such that the two or more demultiplexed audio data streams together comprise the one or more layers of the input audio data stream. Furthermore, the apparatus comprises a merging module (190) for generating the merged audio data stream, having one or more layers, based on the plurality of single-layer audio data streams. Each layer of the input data audio streams, of the demultiplexed audio data streams, of the single-layer data streams and of the merged audio data stream comprises a pressure value of a pressure signal, a position value and a diffuseness value as audio data.

Key Information

Publication No.

EP2600343A1

Family ID

45047686

Publication Date

2013-06-05

Application No.

EP11191816A

Application Date

2011-12-02

Priority Date

2011-12-02

Granted

Yes (14/36)

Possible Cooperation

For further information please contact the transfer office.