Psychological Stress Detection in Speech Using Return-to-opening Phase Ratios in Glottis

dc.contributor.authorStaněk, Miroslavcs
dc.contributor.authorSigmund, Milancs
dc.coverage.issue5cs
dc.coverage.volume21cs
dc.date.issued2015-10-12cs
dc.description.abstractThis paper is focused on investigation of psychological stress in speech signal using shapes of normalised glottal pulses. The pulses were estimated by two algorithms: the Direct Inverse Filtering and Iterative and Adaptive Inverse Filtering. Normalised glottal pulses are divided into opening and return phase, and a feature vector characterizing each glottal pulse is calculated for a series of n percentage interval in time domain. Each feature vector is created by parameters describing its return to opening phase ratio, namely chosen intervals, kurtosis, skewness, and area. Further, psychological stress is detected by feature vector and four different classifiers. Experimental results show, that the best accuracy approaching 95 % is reached with Gaussian Mixture Models classifier. All the best results were obtained regarding only the interval of 5 % from both phase durations, i.e. for and after pulse peak, where the most significant differences between normal and stressed speech in feature vector are occurred. Presented experiments were performed on our own speech database containing both real stressed speech and normal speech.en
dc.description.abstractThis paper is focused on investigation of psychological stress in speech signal using shapes of normalised glottal pulses. The pulses were estimated by two algorithms: the Direct Inverse Filtering and Iterative and Adaptive Inverse Filtering. Normalised glottal pulses are divided into opening and return phase, and a feature vector characterizing each glottal pulse is calculated for a series of n percentage interval in time domain. Each feature vector is created by parameters describing its return to opening phase ratio, namely chosen intervals, kurtosis, skewness, and area. Further, psychological stress is detected by feature vector and four different classifiers. Experimental results show, that the best accuracy approaching 95 % is reached with Gaussian Mixture Models classifier. All the best results were obtained regarding only the interval of 5 % from both phase durations, i.e. for and after pulse peak, where the most significant differences between normal and stressed speech in feature vector are occurred. Presented experiments were performed on our own speech database containing both real stressed speech and normal speech.en
dc.formattextcs
dc.format.extent59-63cs
dc.format.mimetypeapplication/pdfcs
dc.identifier.citationElektronika Ir Elektrotechnika. 2015, vol. 21, issue 5, p. 59-63.en
dc.identifier.doi10.5755/j01.eee.21.5.13336cs
dc.identifier.issn1392-1215cs
dc.identifier.orcid0000-0003-3973-3626cs
dc.identifier.other117402cs
dc.identifier.researcheridAAM-3483-2020cs
dc.identifier.scopus7004163486cs
dc.identifier.urihttp://hdl.handle.net/11012/194765
dc.language.isoencs
dc.publisherKaunas University of Technologycs
dc.relation.ispartofElektronika Ir Elektrotechnikacs
dc.relation.urihttp://www.eejournal.ktu.lt/index.php/elt/article/view/13336cs
dc.rightsCreative Commons Attribution 4.0 Internationalcs
dc.rights.accessopenAccesscs
dc.rights.sherpahttp://www.sherpa.ac.uk/romeo/issn/1392-1215/cs
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/cs
dc.subjectemotion recognitionen
dc.subjectglottal pulsesen
dc.subjectpsychological stressen
dc.subjectspeech processingen
dc.subjectemotion recognition
dc.subjectglottal pulses
dc.subjectpsychological stress
dc.subjectspeech processing
dc.titlePsychological Stress Detection in Speech Using Return-to-opening Phase Ratios in Glottisen
dc.title.alternativePsychological Stress Detection in Speech Using Return-to-opening Phase Ratios in Glottisen
dc.type.driverarticleen
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen
sync.item.dbidVAV-117402en
sync.item.dbtypeVAVen
sync.item.insts2025.10.14 14:11:18en
sync.item.modts2025.10.14 09:49:31en
thesis.grantorVysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií. Ústav radioelektronikycs

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
13336Article Text3846711020151012.pdf
Size:
1.14 MB
Format:
Adobe Portable Document Format
Description:
13336Article Text3846711020151012.pdf