Neural Networks With Dilated Convolutions For Sound Event Recognition

but.event.date27.04.2021cs
but.event.titleSTUDENT EEICT 2021cs
dc.contributor.authorMiklanek, Stepan
dc.date.accessioned2021-07-21T07:06:55Z
dc.date.available2021-07-21T07:06:55Z
dc.date.issued2021cs
dc.description.abstractConvolutional neural networks, most commonly deployed in image classification tasks,typically use square-shaped convolutional kernels, which are well suited for feature extraction fromtwo-dimensional data. This study explores the effect of utilizing spectrally aware dilated convolutionsspecialized for sound event recognition. By extending the base kernels in the time or the frequencydimension, the features extracted from the spectral audio representations should, in theory, bettercapture the temporal and timbral information of different sound events. The baseline neural networkmodel with squared kernels was compared against three models, which used an increasing dilationfactor in the subsequent convolutional layers. The three models were purposefully tuned to focustowards the frequency and time feature extraction. The results have shown that the models withdilated convolutions performed noticeably better in comparison with the baseline model.en
dc.formattextcs
dc.format.extent581-585cs
dc.format.mimetypeapplication/pdfen
dc.identifier.citationProceedings I of the 27st Conference STUDENT EEICT 2021: General papers. s. 581-585. ISBN 978-80-214-5942-7cs
dc.identifier.isbn978-80-214-5942-7
dc.identifier.urihttp://hdl.handle.net/11012/200699
dc.language.isoencs
dc.publisherVysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologiícs
dc.relation.ispartofProceedings I of the 27st Conference STUDENT EEICT 2021: General papersen
dc.relation.urihttps://conf.feec.vutbr.cz/eeict/index/pages/view/ke_stazenics
dc.rights© Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologiícs
dc.rights.accessopenAccessen
dc.subjectsound event recognition; convolutional neural networks; dilated convolutionen
dc.titleNeural Networks With Dilated Convolutions For Sound Event Recognitionen
dc.type.driverconferenceObjecten
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen
eprints.affiliatedInstitution.departmentFakulta elektrotechniky a komunikačních technologiícs
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
581_eeict-2021_1.pdf
Size:
163.22 KB
Format:
Adobe Portable Document Format
Description: