Encoding by Reconstructing Phase Information Using a Structure Tensor on Audio Spectrograms

Publication: WO2017158105A1
Published: 2017-09-21
Family Size: 11
Granted: Yes (5/11)

Simple SummaryContent extracted from patent full text and abstract with AI.

This patent describes an apparatus and method for reconstructing the phase information of audio signals from magnitude spectrograms using the structure tensor, a mathematical tool typically used in image processing. By analyzing the direction and rate of frequency changes for each time-frequency bin in a spectrogram, the invention allows for improved reconstruction of audio signals, especially for distinguishing between harmonic, percussive, and residual components. This method is notably effective for handling complex audio cases like frequency-modulated sounds (e.g., vibrato) that traditional methods have trouble classifying.

Use CasesContent extracted from patent full text and abstract with AI.

  • Audio signal compression and coding, by transmitting only magnitude spectrograms and reconstructing phases at the decoder for efficient storage or transmission.
  • Music information retrieval, including tasks like instrument separation, beat tracking, and vocal detection.
  • Audio restoration or enhancement, particularly for remastering, cleaning, or isolating certain components from historical or noisy recordings.
  • Real-time audio effects and remixing, enabling more precise manipulation of percussive and harmonic elements independently (e.g., dynamic upmixing or equalization).
  • Automatic music transcription, chord detection, and rhythm analysis in digital audio workstations (DAWs) and research tools.

BenefitsContent extracted from patent full text and abstract with AI.

  • Provides more accurate separation and reconstruction of audio components, especially for signals with frequency-modulation (e.g., vibrato), overcoming prior limitations.
  • Enables lossy audio coding techniques with lower bitrates by only needing to transmit magnitude data and reconstructing phase efficiently at the receiver/decoder.
  • Reduces computational complexity compared to prior state-of-the-art phase reconstruction methods, making it practical for real-time or embedded applications.
  • Improves perceptible audio quality when reconstructing signals from spectrograms, due to better phase estimation.
  • Versatile implementation: The method can be applied in software, hardware, or hybrid systems, and integrated into existing digital audio processing pipelines.

Technical Classifications (CPCs)

Main Classifications

Electrical & Electronic Tech

Physics & Measurement

Sub Classifications

Computing & Calculating

Electric Communication Technique

Musical Instruments & Acoustics

CPC Codes

G06F16/683G10L19/00G10L21/00G10L25/18H04R3/04H04S7/307

Inventors & Applicants

Applicants

Fraunhofer Ges Forschung

Friedrich-alexander-universitaet Erlangen-nuernberg

Patent Abstract

An apparatus for phase reconstruction from a magnitude spectrogram of an audio signal is provided. The apparatus comprises a frequency change determiner (110) being configured to determine a change of a frequency for each time-frequency bin of a plurality of time-frequency bins of the magnitude spectrogram of the audio signal depending on the magnitude spectrogram of the audio signal, and a phase reconstructor (140) being configured to generate phase values for the plurality of time-frequency bins depending on the changes of the frequencies determined for the plurality of the time-frequency bins.

Key Information

Publication No.

WO2017158105A1

Family ID

55646317

Publication Date

2017-09-21

Application No.

EP2017056263W

Application Date

2017-03-16

Priority Date

2016-03-18

Granted

Yes (5/11)

Possible Cooperation

For further information please contact the transfer office.