Hybrid Deep Learning Model for Singing Voice Separation

Amer, Rusul; Al Tmeme, Ahmed

doi:10.13164/mendel.2021.2.044

Hybrid Deep Learning Model for Singing Voice Separation

Files

139-Article Text-364-4-10-20220124.pdf (479.49 KB)

Date

2021-12-21

Authors

Amer, Rusul

Al Tmeme, Ahmed

Publisher

Institute of Automation and Computer Science, Brno University of Technology

Altmetrics

Abstract

Monaural source separation is a challenging issue due to the fact that there is only a single channel available; however, there is an unlimited range of possible solutions. In this paper, a monaural source separation model based hybrid deep learning model, which consists of convolution neural network (CNN), dense neural network (DNN) and recurrent neural network (RNN), will be presented. A trial and error method will be used to optimize the number of layers in the proposed model. Moreover, the effects of the learning rate, optimization algorithms, and the number of epochs on the separation performance will be explored. Our model was evaluated using the MIR-1K dataset for singing voice separation. Moreover, the proposed approach achieves (4.81) dB GNSDR gain, (7.28) dB GSIR gain, and (3.39) dB GSAR gain in comparison to current approaches

Keywords

Monaural Source Separation , Hybrid Deep Learning , Time Frequency Masking , Convolution Neural Network , Dense Neural Network , Recurrent Neural Network

Citation

Mendel. 2021 vol. 27, č. 2, s. 44-50. ISSN 1803-3814
https://mendel-journal.org/index.php/mendel/article/view/139

Document type

Peer-reviewed

Document version

Published version

Language of document

en

DOI

10.13164/mendel.2021.2.044

URI

http://hdl.handle.net/11012/203391

Collections

Vol. 27, No. 2

Creative Commons license

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International license

Except where otherwised noted, this item's license is described as Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International license

Citace PRO

Full item page

Hybrid Deep Learning Model for Singing Voice Separation

Files

Date

Authors

Advisor

Referee

Mark

Journal Title

Journal ISSN

Volume Title

Publisher

ORCID

Altmetrics

Abstract

Description

Keywords

Citation

Document type

Document version

Date of access to the full text

Language of document

Study field

Comittee

Date of acceptance

Defence

Result of defence

DOI

URI

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Citace PRO