Apparatus, Method, Computer Program and Bitstream for Quality Control And/or Enhancement of Audio Scenes
Simple SummaryContent extracted from patent full text and abstract with AI.
This patent introduces an apparatus, method, computer program, and bitstream for quality control and/or enhancement of audio scenes. The core invention analyzes audio content to automatically detect 'critical passages' where speech intelligibility is compromised due to background noise or music. By measuring short-term intensity differences between speech (dialogue) and background portions, the system generates reports, metadata, and/or adjustments to improve listening comfort and clarity. It can process both fully mixed and multi-channel/object-based audio formats and supports integration in audio production, streaming, broadcast, and playback devices. Enhancements can be performed either during production or dynamically at playback, based on metadata or user/device/environment settings.
Use CasesContent extracted from patent full text and abstract with AI.
- Broadcast television and streaming services to ensure that dialogue in movies and TV shows remains clear and intelligible, even with dynamic background music or sound effects.
- Automated quality control during audio production, post-production, and mastering to identify problematic audio passages and suggest or apply corrections.
- Integration in Next Generation Audio (NGA) systems (e.g., MPEG-H Audio, AAC, AC-4) to enable dialogue enhancement, especially for users with hearing impairments or in noisy environments.
- On-device audio enhancement in TVs, smartphones, tablets, and hearing aids, where playback can be dynamically optimized based on the audio scene and current listening environment.
- Providing accessibility features for audiences with special needs by identifying and enhancing sections with poor speech intelligibility.
- Streaming platforms can use the generated metadata to enable user-selectable dialogue enhancement or automatic adaptation based on environmental noise or personal preference.
BenefitsContent extracted from patent full text and abstract with AI.
- Significantly improves speech intelligibility in audio content by automatically detecting and enhancing problematic passages, especially in complex mixes with loud backgrounds.
- Low computational complexity: Focuses on low-level intensity measures rather than requiring complex cognitive or linguistic analysis, making it suitable for real-time processing and broad deployment.
- Flexible integration: Can be implemented as a standalone tool, production plugin, or embedded in existing audio encoding, broadcast, or playback frameworks as an extension (e.g., via metadata).
- Supports a wide range of audio formats and delivery channels, including traditional mixes and modern object-based audio scenes.
- Enables individualized listening experiences and accessibility by allowing enhancement to be tailored to user preferences, device capabilities, and the listening environment.
- Allows content creators and broadcasters to ensure regulatory compliance and higher quality of experience with minimal manual effort.
Technical Classifications (CPCs)
Main Classifications
Physics & Measurement
Sub Classifications
Musical Instruments & Acoustics
CPC Codes
Inventors & Applicants
Applicants
Fraunhofer Ges Forschung
Univ Friedrich Alexander Er
Patent Abstract
Embodiments according to the invention comprise apparatuses, methods, computer programs and bitstreams for quality control and/or enhancement of audio scenes. Embodiments according to the invention are related to apparatuses and methods for quality control and enhancement of audio scenes.
Key Information
Publication No.
WO2025022009A1
Family ID
87517141
Publication Date
2025-01-30
Application No.
EP2024071374W
Application Date
2024-07-26
Priority Date
2023-07-26
Granted
No
Possible Cooperation
For further information please contact the transfer office.