Cost-Efficient Development of Acoustic Models for Speech Recognition of Related Languages
Loading...
Date
2013-09
Authors
Nouza, Jan
Cerva, Petr
Kucharova, Michaela
ORCID
Advisor
Referee
Mark
Journal Title
Journal ISSN
Volume Title
Publisher
Společnost pro radioelektronické inženýrství
Abstract
When adapting an existing speech recognition system to a new language, major development costs are associated with the creation of an appropriate acoustic model (AM). For its training, a certain amount of recorded and annotated speech is required. In this paper, we show that not only the annotation process, but also the process of speech acquisition can be automated to minimize the need of human and expert work. We demonstrate the proposed methodology on Croatian language, for which the target AM has been built via cross-lingual adaptation of a Czech AM in 2 ways: a) using commercially available GlobalPhone database, and b) by automatic speech data mining from HRT radio archive. The latter approach is cost-free, yet it yields comparable or better results in LVCSR experiments conducted on 3 Croatian test sets.
Description
Citation
Radioengineering. 2013, vol. 22, č. 3, s. 866-873. issn 1210-2512
http://www.radioeng.cz/fulltexts/2013/13_03_0866_0873.pdf
http://www.radioeng.cz/fulltexts/2013/13_03_0866_0873.pdf
Document type
Peer-reviewed
Document version
Published version
Date of access to the full text
Language of document
en