Apparatus, Method, Computer Program and Bitstream for Quality Control And/or Enhancement of Audio Scenes

Publication: WO2025022009A1

Published: 2025-01-30

Family Size: 1

Granted: No

Simple SummaryContent extracted from patent full text and abstract with AI.

This patent introduces an apparatus, method, computer program, and bitstream for quality control and/or enhancement of audio scenes. The core invention analyzes audio content to automatically detect 'critical passages' where speech intelligibility is compromised due to background noise or music. By measuring short-term intensity differences between speech (dialogue) and background portions, the system generates reports, metadata, and/or adjustments to improve listening comfort and clarity. It can process both fully mixed and multi-channel/object-based audio formats and supports integration in audio production, streaming, broadcast, and playback devices. Enhancements can be performed either during production or dynamically at playback, based on metadata or user/device/environment settings.

Use CasesContent extracted from patent full text and abstract with AI.

Broadcast television and streaming services to ensure that dialogue in movies and TV shows remains clear and intelligible, even with dynamic background music or sound effects.
Automated quality control during audio production, post-production, and mastering to identify problematic audio passages and suggest or apply corrections.
Integration in Next Generation Audio (NGA) systems (e.g., MPEG-H Audio, AAC, AC-4) to enable dialogue enhancement, especially for users with hearing impairments or in noisy environments.
On-device audio enhancement in TVs, smartphones, tablets, and hearing aids, where playback can be dynamically optimized based on the audio scene and current listening environment.
Providing accessibility features for audiences with special needs by identifying and enhancing sections with poor speech intelligibility.
Streaming platforms can use the generated metadata to enable user-selectable dialogue enhancement or automatic adaptation based on environmental noise or personal preference.

BenefitsContent extracted from patent full text and abstract with AI.

Significantly improves speech intelligibility in audio content by automatically detecting and enhancing problematic passages, especially in complex mixes with loud backgrounds.
Low computational complexity: Focuses on low-level intensity measures rather than requiring complex cognitive or linguistic analysis, making it suitable for real-time processing and broad deployment.
Flexible integration: Can be implemented as a standalone tool, production plugin, or embedded in existing audio encoding, broadcast, or playback frameworks as an extension (e.g., via metadata).
Supports a wide range of audio formats and delivery channels, including traditional mixes and modern object-based audio scenes.
Enables individualized listening experiences and accessibility by allowing enhancement to be tailored to user preferences, device capabilities, and the listening environment.
Allows content creators and broadcasters to ensure regulatory compliance and higher quality of experience with minimal manual effort.

Technical Classifications (CPCs)

Main Classifications

Physics & Measurement

Sub Classifications

Musical Instruments & Acoustics

CPC Codes

G10L19/00G10L21/00G10L25/21G10L25/30G10L25/45G10L25/60

Inventors & Applicants

Inventors

Applicants

Fraunhofer Ges Forschung

Univ Friedrich Alexander Er

Patent Abstract

Embodiments according to the invention comprise apparatuses, methods, computer programs and bitstreams for quality control and/or enhancement of audio scenes. Embodiments according to the invention are related to apparatuses and methods for quality control and enhancement of audio scenes.

Key Information

Publication No.

WO2025022009A1

Family ID

87517141

Publication Date

2025-01-30

Application No.

EP2024071374W

Application Date

2024-07-26

Priority Date

2023-07-26

Granted

Possible Cooperation

For further information please contact the transfer office.

See full document in Espacenet