Deep prior audio compression

Švento, Michal; Balušík, Peter

Deep prior audio compression

Files

226-eeict-2024.pdf(881.13 KB)

Date

2024

Authors

Švento, Michal

Balušík, Peter

Publisher

Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií

Abstract

Audio compression is still an up-to-date topic because the demand for big data streams is rapidly increasing. Deep learning has brought up new algorithms that decrease bitrates with good perception quality. The novel approach in generative artificial intelligence is to produce new data from prior stored in network parameters, called a deep prior. The deep audio prior framework shows its success in various tasks such as inpainting, declipping, and bandwidth extension, but it has not been tested for compression. In this paper, we test this method with a prebuilt network for inpainting. Our idea of compression is based on reducing the number of time-frequency coefficients in the spectrogram while allowing the reconstruction of the original signal with high quality.

Keywords

audio processing, deep learning, deep audio prior, compression

Citation

Proceedings I of the 30st Conference STUDENT EEICT 2024: General papers. s. 226-230. ISBN 978-80-214-6231-1
https://www.eeict.cz/eeict_download/archiv/sborniky/EEICT_2024_sbornik_1.pdf

Document type

Peer-reviewed

Document version

Published version

Language of document

en

Document licence