Deep prior audio compression
Loading...
Date
2024
Authors
Švento, Michal
Balušík, Peter
ORCID
Advisor
Referee
Mark
Journal Title
Journal ISSN
Volume Title
Publisher
Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií
Abstract
Audio compression is still an up-to-date topic because the demand for big data streams is rapidly increasing. Deep learning has brought up new algorithms that decrease bitrates with good perception quality. The novel approach in generative artificial intelligence is to produce new data from prior stored in network parameters, called a deep prior. The deep audio prior framework shows its success in various tasks such as inpainting, declipping, and bandwidth extension, but it has not been tested for compression. In this paper, we test this method with a prebuilt network for inpainting. Our idea of compression is based on reducing the number of time-frequency coefficients in the spectrogram while allowing the reconstruction of the original signal with high quality.
Description
Citation
Proceedings I of the 30st Conference STUDENT EEICT 2024: General papers. s. 226-230. ISBN 978-80-214-6231-1
https://www.eeict.cz/eeict_download/archiv/sborniky/EEICT_2024_sbornik_1.pdf
https://www.eeict.cz/eeict_download/archiv/sborniky/EEICT_2024_sbornik_1.pdf
Document type
Peer-reviewed
Document version
Published version
Date of access to the full text
Language of document
en
Study field
Comittee
Date of acceptance
Defence
Result of defence
Document licence
© Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií