Masking Threshold Determinator, Audio Encoder, Method and Computer Program for Determining a Masking Threshold Information

Publication: WO2024008928A1
Published: 2024-01-11
Family Size: 3
Granted: No

Simple SummaryContent extracted from patent full text and abstract with AI.

This patent describes an advanced method and system for determining masking thresholds in audio signals, which are thresholds below which sounds are not perceptible to the human ear due to the presence of other sounds (masking effect). The invention uses a bank of filters (such as all-pole gammatone filters) with different bandwidths to analyze an audio signal in multiple frequency bands. By processing and combining these filtered signals—taking into account magnitude, phase, and non-linear psychoacoustic properties—the system can accurately estimate frequency-dependent masking thresholds. This information is then used to optimize how audio is encoded (compressed), improving audio quality or reducing required data rates.

Use CasesContent extracted from patent full text and abstract with AI.

  • Integration in audio codecs for music streaming services to improve sound quality at lower bitrates.
  • Optimization of audio compression in portable devices (such as smartphones or media players) to reduce storage or transmission requirements.
  • Implementation in professional audio production software to enhance perceptual coding and mastering processes.
  • Use in hearing aids or sound processing devices for better adaptation to human auditory perception.
  • Deployment in broadcasting systems (radio, television) to maintain audio quality while saving bandwidth.
  • Enhancement of virtual or augmented reality audio systems for more realistic sound rendering.

BenefitsContent extracted from patent full text and abstract with AI.

  • More accurate modeling of human auditory masking, resulting in better perceptual audio quality at equal or lower bitrates.
  • Allows audio encoders to allocate more bits to perceptually important parts of the signal and fewer bits to masked or inaudible components, increasing coding efficiency.
  • Supports both magnitude and phase information as well as non-linear psychoacoustic effects, leading to improved psychoacoustic model realism.
  • Reduces computational complexity compared to prior approaches while maintaining or improving audio fidelity.
  • Enables flexible adaptation to various audio content and encoding scenarios (speech, music, complex soundscapes).
  • Can be implemented as software or hardware, and is compatible with current and future audio coding standards.

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Physics & Measurement

Sub Classifications

Electronic Circuitry

Musical Instruments & Acoustics

CPC Codes

G10L19/032G10L25/48H03M7/3066H03M7/6011

Inventors & Applicants

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

Embodiments are related to a masking threshold determinator (100, 200, 340, 1200, 1300, 3200), wherein the masking threshold determinator is configured to obtain a plurality of bandpass signals (111, 211, 311, 1211, 1311, 3211) using a plurality of filters (110, 210, 310, 1210, 1310, 3210) having different bandwidths; and wherein the masking threshold determinator is configured to obtain a masking threshold information associated with a given frequency region on the basis of bandpass signal values of at least two bandpass signals. Furthermore, audio encoders, methods and computer programs are disclosed.

Key Information

Publication No.

WO2024008928A1

Family ID

82403405

Publication Date

2024-01-11

Application No.

EP2023068858W

Application Date

2023-07-07

Priority Date

2022-07-07

Granted

No

Possible Cooperation

For further information please contact the transfer office.