Automatic Text-Independent Artifact Detection, Localization, and Classification in the Synthetic Speech

Pribil, Jiri; Pribilova, Anna; Matousek, Jindrich

doi:10.13164/re.2017.1151

Automatic Text-Independent Artifact Detection, Localization, and Classification in the Synthetic Speech

dc.contributor.author	Pribil, Jiri
dc.contributor.author	Pribilova, Anna
dc.contributor.author	Matousek, Jindrich
dc.coverage.issue	4	cs
dc.coverage.volume	26	cs
dc.date.accessioned	2018-06-18T10:29:50Z
dc.date.available	2018-06-18T10:29:50Z
dc.date.issued	2017-12	cs
dc.description.abstract	The paper describes experiments with statistical approaches to automatic detection, localization, and classification of the basic types of artifacts in the synthetic speech produced by the Czech text-to-speech system using the unit selection method. The first experiment is aimed at artifact detection by the analysis of variances (ANOVA) and hypothesis testing. The second experiment is focused on localization of the detected artifacts by the Gaussian mixture models (GMM). Finally, the developed open-set artifact classifier is described. The influence of the feature vector length and structure on the resulting artifact detection accuracy is analyzed together with other factors affecting the stability of the artifact detection process. Further investigations have shown a relatively great influence of the number of mixtures and the type of a covariance matrix on the artifact classification error rate as well as on the computational complexity. The obtained experimental results confirm the functionality of the artifact detector based on the ANOVA and hypothesis tests, and the GMM-based artifact localizer and classifier. The described statistical approaches represent the alternatives to the standard listening tests and the manual labeling of the artifacts.	en
dc.format	text	cs
dc.format.extent	1151-1160	cs
dc.format.mimetype	application/pdf	en
dc.identifier.citation	Radioengineering. 2017 vol. 26, č. 4, s. 1151-1160. ISSN 1210-2512	cs
dc.identifier.doi	10.13164/re.2017.1151	en
dc.identifier.issn	1210-2512
dc.identifier.uri	http://hdl.handle.net/11012/82947
dc.language.iso	en	cs
dc.publisher	Společnost pro radioelektronické inženýrství	cs
dc.relation.ispartof	Radioengineering	cs
dc.relation.uri	http://www.radioeng.cz/fulltexts/2017/17_01_1151_1160.pdf	cs
dc.rights	Creative Commons Attribution 4.0 International	en
dc.rights.access	openAccess	en
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en
dc.subject	Quality of synthetic speech	en
dc.subject	analysis of variances (ANOVA)	en
dc.subject	Gaussian mixture models (GMM) classification	en
dc.subject	text-to-speech (TTS) system	en
dc.title	Automatic Text-Independent Artifact Detection, Localization, and Classification in the Synthetic Speech	en
dc.type.driver	article	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en
eprints.affiliatedInstitution.faculty	Fakulta eletrotechniky a komunikačních technologií	cs

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 17_04_1151_1160.pdf
Size:: 426.46 KB
Format:: Adobe Portable Document Format
Description:

Download

Collections

2017/4