Study and Application of Silence Model Adaptation for Use in Telephone Speech Recognition System

Loading...
Thumbnail Image

Authors

Novotny, Jan
Sovka, Pavel
Uhlir, Jan

Advisor

Referee

Mark

Journal Title

Journal ISSN

Volume Title

Publisher

Společnost pro radioelektronické inženýrství

ORCID

Abstract

This paper addresses the problem of the mismatch between a silence model and background noises which often occurs in a telephone speech recognition system (SRS) application. At first, the use of parallel model combination (PMC) methods is studied with the respect to this application. Secondly, the effective adaptation of a silence model to various background noises is confirmed. Finally, an original method combining log-add PMC with a noise power spectral density estimation based on minimum statistics is proposed. The performed tests prove the benefit of the suggested method to the speech recognition results that is caused by the stability of speech vector selection under the influence of various background noises. The advantages can be seen in no extra voice activity detector and in a relatively low computational load.

Description

Citation

Radioengineering. 2004, vol. 13, č. 3, s. 1-6. ISSN 1210-2512
http://www.radioeng.cz/fulltexts/2004/04_03_01_06.pdf

Document type

Peer-reviewed

Document version

Published version

Date of access to the full text

Language of document

en

Study field

Comittee

Date of acceptance

Defence

Result of defence

DOI

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Except where otherwised noted, this item's license is described as Creative Commons Attribution 3.0 Unported License
Citace PRO