Cost-Efficient Development of Acoustic Models for Speech Recognition of Related Languages

Nouza, Jan; Cerva, Petr; Kucharova, Michaela

Cost-Efficient Development of Acoustic Models for Speech Recognition of Related Languages

Files

13_03_0866_0873.pdf (284.22 KB)

Date

2013-09

Authors

Nouza, Jan

Cerva, Petr

Kucharova, Michaela

Publisher

Společnost pro radioelektronické inženýrství

Abstract

When adapting an existing speech recognition system to a new language, major development costs are associated with the creation of an appropriate acoustic model (AM). For its training, a certain amount of recorded and annotated speech is required. In this paper, we show that not only the annotation process, but also the process of speech acquisition can be automated to minimize the need of human and expert work. We demonstrate the proposed methodology on Croatian language, for which the target AM has been built via cross-lingual adaptation of a Czech AM in 2 ways: a) using commercially available GlobalPhone database, and b) by automatic speech data mining from HRT radio archive. The latter approach is cost-free, yet it yields comparable or better results in LVCSR experiments conducted on 3 Croatian test sets.

Keywords

Speech recognition , acoustic model , cross-lingual adaptation , Slavic languages

Citation

Radioengineering. 2013, vol. 22, č. 3, s. 866-873. issn 1210-2512
http://www.radioeng.cz/fulltexts/2013/13_03_0866_0873.pdf

Document type

Peer-reviewed

Document version

Published version

Language of document

en

URI

http://hdl.handle.net/11012/36938

Collections

2013/3

Creative Commons license

Except where otherwised noted, this item's license is described as Creative Commons Attribution 3.0 Unported License

Citace PRO

Full item page

Cost-Efficient Development of Acoustic Models for Speech Recognition of Related Languages

Files

Date

Authors

Advisor

Referee

Mark

Journal Title

Journal ISSN

Volume Title

Publisher

ORCID

Abstract

Description

Keywords

Citation

Document type

Document version

Date of access to the full text

Language of document

Study field

Comittee

Date of acceptance

Defence

Result of defence

DOI

URI

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Citace PRO