Analysis of Closing-To-Opening Phase Ratio in Top-To-Bottom Glottal Pulse Segmentation for Psychological Stress Detection

dc.contributor.authorStaněk, Miroslavcs
dc.contributor.authorSigmund, Milancs
dc.coverage.issue5cs
dc.coverage.volume22cs
dc.date.accessioned2021-12-10T11:52:15Z
dc.date.available2021-12-10T11:52:15Z
dc.date.issued2016-10-10cs
dc.description.abstractThis paper is focused on investigating the differences in glottal pulses estimated by two algorithms; Direct Inverse Filtering (DIF) and Iterative and Adaptive Inverse Filtering (IAIF) for normal and stressed speech. Individual glottal pulses are mined from recorded speech signal and then normalized in two dimensions. Each normalized pulse is divided into a closing and opening phase and further segmented into npercentage sectors in Top-To-Bottom (TTB) amplitude domain. Three parameters, the kurtosis, skewness and pulse area, as well as their Closing-To-Opening phase ratios, are analysed. Designed GMM classifier is trained on speakers from Czech ExamStress database a further applied on other part of ExamStress database and also for English database SUSAS to investigate the independency of presented approach on spoken language and speech signal quality. The results achieved by DIF indicate independency on language and records quality (contrary to methods using IAIF). The best npercentage sectors in the TTB segments can be seen between 5 % and 40 %. In this case, methods based on DIF reached a psychological stress recognition efficiency of 88.5 % in average. The average stress detection efficiency of methods based on IAIF approached 73.3 %.en
dc.description.abstractČlánek se zabývá použitím nových příznaků hlasivkových pulsů TTB-CTO pro rozpoznávání stresového stavu mluvčího. Jako klasifikátor jsou zvoleny Gaussovské smíšené modely, a jazyková nezávislost je ověřena na anglicé databázi SUSAS.cs
dc.formattextcs
dc.format.extent79-83cs
dc.format.mimetypeapplication/pdfcs
dc.identifier.citationElektronika Ir Elektrotechnika. 2016, vol. 22, issue 5, p. 79-83.en
dc.identifier.doi10.5755/j01.eie.22.5.16348cs
dc.identifier.issn1392-1215cs
dc.identifier.other128773cs
dc.identifier.urihttp://hdl.handle.net/11012/203146
dc.language.isoencs
dc.publisherKaunas University of Technologycs
dc.relation.ispartofElektronika Ir Elektrotechnikacs
dc.relation.urihttp://www.eejournal.ktu.lt/index.php/elt/article/view/16348cs
dc.rightsCreative Commons Attribution 4.0 Internationalcs
dc.rights.accessopenAccesscs
dc.rights.sherpahttp://www.sherpa.ac.uk/romeo/issn/1392-1215/cs
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/cs
dc.subjectAnalysis of speaker stateen
dc.subjectpsychological stress detectionen
dc.subjectglottal pulse analysisen
dc.subjectclosing-to-opening phase ratioen
dc.subjectAnalýza stavu reproduktor
dc.subjectpsychické vypětí detekce
dc.subjectglotální pulzní analýza
dc.subjectzavírání a otevírání fáze poměr
dc.titleAnalysis of Closing-To-Opening Phase Ratio in Top-To-Bottom Glottal Pulse Segmentation for Psychological Stress Detectionen
dc.title.alternativeAnalýza TTB-CTO příznaků hlasivkových pulsů pro detekci stresu v řečics
dc.type.driverarticleen
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen
sync.item.dbidVAV-128773en
sync.item.dbtypeVAVen
sync.item.insts2021.12.10 12:52:15en
sync.item.modts2021.12.10 12:14:25en
thesis.grantorVysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií. Ústav radioelektronikycs
thesis.grantorVysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií. oddělení-REL-SIXcs
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
16348Article Text4800511020161007.pdf
Size:
1.12 MB
Format:
Adobe Portable Document Format
Description:
16348Article Text4800511020161007.pdf