Masking Threshold Determinator, Audio Encoder, Method and Computer Program for Determining a Masking Threshold Information
Simple SummaryContent extracted from patent full text and abstract with AI.
This patent describes an advanced method and system for determining masking thresholds in audio signals, which are thresholds below which sounds are not perceptible to the human ear due to the presence of other sounds (masking effect). The invention uses a bank of filters (such as all-pole gammatone filters) with different bandwidths to analyze an audio signal in multiple frequency bands. By processing and combining these filtered signals—taking into account magnitude, phase, and non-linear psychoacoustic properties—the system can accurately estimate frequency-dependent masking thresholds. This information is then used to optimize how audio is encoded (compressed), improving audio quality or reducing required data rates.
Use CasesContent extracted from patent full text and abstract with AI.
- Integration in audio codecs for music streaming services to improve sound quality at lower bitrates.
- Optimization of audio compression in portable devices (such as smartphones or media players) to reduce storage or transmission requirements.
- Implementation in professional audio production software to enhance perceptual coding and mastering processes.
- Use in hearing aids or sound processing devices for better adaptation to human auditory perception.
- Deployment in broadcasting systems (radio, television) to maintain audio quality while saving bandwidth.
- Enhancement of virtual or augmented reality audio systems for more realistic sound rendering.
BenefitsContent extracted from patent full text and abstract with AI.
- More accurate modeling of human auditory masking, resulting in better perceptual audio quality at equal or lower bitrates.
- Allows audio encoders to allocate more bits to perceptually important parts of the signal and fewer bits to masked or inaudible components, increasing coding efficiency.
- Supports both magnitude and phase information as well as non-linear psychoacoustic effects, leading to improved psychoacoustic model realism.
- Reduces computational complexity compared to prior approaches while maintaining or improving audio fidelity.
- Enables flexible adaptation to various audio content and encoding scenarios (speech, music, complex soundscapes).
- Can be implemented as software or hardware, and is compatible with current and future audio coding standards.
Technical Classifications (CPCs)
Main Classifications
Electrical & Electronic Tech
Physics & Measurement
Sub Classifications
Electronic Circuitry
Musical Instruments & Acoustics
CPC Codes
Inventors & Applicants
Applicants
Fraunhofer Ges Forschung
Univ Friedrich Alexander Er
Patent Abstract
Embodiments are related to a masking threshold determinator (100, 200, 340, 1200, 1300, 3200), wherein the masking threshold determinator is configured to obtain a plurality of bandpass signals (111, 211, 311, 1211, 1311, 3211) using a plurality of filters (110, 210, 310, 1210, 1310, 3210) having different bandwidths; and wherein the masking threshold determinator is configured to obtain a masking threshold information associated with a given frequency region on the basis of bandpass signal values of at least two bandpass signals. Furthermore, audio encoders, methods and computer programs are disclosed.
Key Information
Publication No.
WO2024008928A1
Family ID
82403405
Publication Date
2024-01-11
Application No.
EP2023068858W
Application Date
2023-07-07
Priority Date
2022-07-07
Granted
No
Possible Cooperation
For further information please contact the transfer office.