A quasi-orthogonal, invertible, and perceptually relevant time-frequency transform for audio codingReport as inadecuate




A quasi-orthogonal, invertible, and perceptually relevant time-frequency transform for audio coding - Download this document for free, or read online. Document in PDF available to download.

* Corresponding author 1 Sons LMA - Laboratoire de Mécanique et d-Acoustique Marseille 2 Sons ARI - Acoustics Research Institute

Abstract : We describe ERB-MDCT, an invertible real-valued time-frequency transform based on MDCT, which is widely used in audio coding e.g. MP3 and AAC. ERB-MDCT was designed similarly to ERBLet, a recent invertible transform with a resolution evolving across frequency to match the perceptual ERB frequency scale, while the frequency scale in most invertible transforms e.g. MDCT is uniform. ERB-MDCT has mostly the same frequency scale as ERBLet, but the main improvement is that atoms are quasi-orthogonal, i.e. its redundancy is close to 1. Furthermore, the energy is more sparse in the time-frequency plane. Thus, it is more suitable for audio coding than ERBLet.

Keywords : Non-stationary time-frequency transforms ERB filters MDCT Audio coding





Author: Olivier Derrien - Thibaud Necciari - Peter Balazs -

Source: https://hal.archives-ouvertes.fr/



DOWNLOAD PDF




Related documents