Apparatus and Method for Encoding or Decoding Directional Audio Coding Parameters Using Quantization and Entropy Coding

Publication: WO2019097018A1
Published: 2019-05-23
Family Size: 69
Granted: Yes (27/69)

Simple SummaryContent extracted from patent full text and abstract with AI.

This invention provides an apparatus and method for compressing and encoding the parameters used in directional audio coding (DirAC), which describe how sound is located and spread in space (diffuseness and directional parameters). The method intelligently compresses these parameters using quantization and entropy coding, significantly reducing the bit-rate needed to transmit or store spatial audio, while maintaining high audio quality and spatial accuracy. The quantization is adaptive, balancing precision based on the importance of different audio regions, allowing efficient representation of 3D spatial audio scenes at low data rates.

Use CasesContent extracted from patent full text and abstract with AI.

  • Immersive 3D audio in virtual reality (VR) and augmented reality (AR) environments, enabling efficient streaming of spatial audio metadata.
  • Spatial audio for online gaming or simulations, providing accurate sound localization with minimal bandwidth.
  • Teleconferencing systems that require clear, spatially accurate audio representation with low transmission rates.
  • Entertainment and live broadcasting applications needing surround or 3D audio without high data overhead.
  • Hearing aids or assistive listening devices that benefit from low-latency, spatially accurate audio rendering.
  • Cinema audio post-production, allowing flexible manipulation and efficient storage or transmission of spatial sound information.

BenefitsContent extracted from patent full text and abstract with AI.

  • Enables efficient compression of directional spatial audio metadata (such as DirAC) to very low bit-rates, reducing bandwidth and storage requirements.
  • Preserves high spatial accuracy and audio quality despite compression, maintaining an immersive listening experience.
  • Adaptive quantization leverages perceptual audio cues, providing higher precision where human listeners are more sensitive and less where sound is less critical.
  • Constant or logarithmic complexity encoding/decoding allows for real-time processing and implementation on resource-constrained devices.
  • Flexible framework supports various spatial audio formats and playback configurations (e.g., Ambisonics, surround sound, object-based audio).
  • Robustness to variable audio scenes, supporting both speech and complex, dynamic environments with multiple moving sources.

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Physics & Measurement

Sub Classifications

Electronic Circuitry

Musical Instruments & Acoustics

CPC Codes

G10L19/00G10L19/008G10L19/0204G10L19/032G10L19/038G10L19/167G10L19/26G10L25/15G10L25/21H03M7/3082H03M7/6005H03M7/6011

Inventors & Applicants

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

An apparatus for encoding directional audio coding parameters comprising diffuseness parameters and direction parameters, comprises: a parameter quantizer (210) for quantizing the diffuseness parameters and the direction parameters; a parameter encoder (220) for encoding quantized diffuseness parameters and quantized direction parameters; and an output interface (230) for generating an encoded parameter representation comprising information on encoded diffuseness parameters and encoded direction parameters.

Key Information

Publication No.

WO2019097018A1

Family ID

60515115

Publication Date

2019-05-23

Application No.

EP2018081623W

Application Date

2018-11-16

Priority Date

2017-11-17

Granted

Yes (27/69)

Possible Cooperation

For further information please contact the transfer office.