Analysis of Closing-To-Opening Phase Ratio in Top-To-Bottom Glottal Pulse Segmentation for Psychological Stress Detection
| dc.contributor.author | Staněk, Miroslav | cs |
| dc.contributor.author | Sigmund, Milan | cs |
| dc.coverage.issue | 5 | cs |
| dc.coverage.volume | 22 | cs |
| dc.date.issued | 2016-10-10 | cs |
| dc.description.abstract | This paper is focused on investigating the differences in glottal pulses estimated by two algorithms; Direct Inverse Filtering (DIF) and Iterative and Adaptive Inverse Filtering (IAIF) for normal and stressed speech. Individual glottal pulses are mined from recorded speech signal and then normalized in two dimensions. Each normalized pulse is divided into a closing and opening phase and further segmented into npercentage sectors in Top-To-Bottom (TTB) amplitude domain. Three parameters, the kurtosis, skewness and pulse area, as well as their Closing-To-Opening phase ratios, are analysed. Designed GMM classifier is trained on speakers from Czech ExamStress database a further applied on other part of ExamStress database and also for English database SUSAS to investigate the independency of presented approach on spoken language and speech signal quality. The results achieved by DIF indicate independency on language and records quality (contrary to methods using IAIF). The best npercentage sectors in the TTB segments can be seen between 5 % and 40 %. In this case, methods based on DIF reached a psychological stress recognition efficiency of 88.5 % in average. The average stress detection efficiency of methods based on IAIF approached 73.3 %. | en |
| dc.description.abstract | This paper is focused on investigating the differences in glottal pulses estimated by two algorithms; Direct Inverse Filtering (DIF) and Iterative and Adaptive Inverse Filtering (IAIF) for normal and stressed speech. Individual glottal pulses are mined from recorded speech signal and then normalized in two dimensions. Each normalized pulse is divided into a closing and opening phase and further segmented into npercentage sectors in Top-To-Bottom (TTB) amplitude domain. Three parameters, the kurtosis, skewness and pulse area, as well as their Closing-To-Opening phase ratios, are analysed. Designed GMM classifier is trained on speakers from Czech ExamStress database a further applied on other part of ExamStress database and also for English database SUSAS to investigate the independency of presented approach on spoken language and speech signal quality. The results achieved by DIF indicate independency on language and records quality (contrary to methods using IAIF). The best npercentage sectors in the TTB segments can be seen between 5 % and 40 %. In this case, methods based on DIF reached a psychological stress recognition efficiency of 88.5 % in average. The average stress detection efficiency of methods based on IAIF approached 73.3 %. | en |
| dc.format | text | cs |
| dc.format.extent | 79-83 | cs |
| dc.format.mimetype | application/pdf | cs |
| dc.identifier.citation | Elektronika Ir Elektrotechnika. 2016, vol. 22, issue 5, p. 79-83. | en |
| dc.identifier.doi | 10.5755/j01.eie.22.5.16348 | cs |
| dc.identifier.issn | 1392-1215 | cs |
| dc.identifier.orcid | 0000-0003-3973-3626 | cs |
| dc.identifier.other | 128773 | cs |
| dc.identifier.researcherid | AAM-3483-2020 | cs |
| dc.identifier.scopus | 7004163486 | cs |
| dc.identifier.uri | http://hdl.handle.net/11012/203146 | |
| dc.language.iso | en | cs |
| dc.publisher | Kaunas University of Technology | cs |
| dc.relation.ispartof | Elektronika Ir Elektrotechnika | cs |
| dc.relation.uri | http://www.eejournal.ktu.lt/index.php/elt/article/view/16348 | cs |
| dc.rights | Creative Commons Attribution 4.0 International | cs |
| dc.rights.access | openAccess | cs |
| dc.rights.sherpa | http://www.sherpa.ac.uk/romeo/issn/1392-1215/ | cs |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | cs |
| dc.subject | Analysis of speaker state | en |
| dc.subject | psychological stress detection | en |
| dc.subject | glottal pulse analysis | en |
| dc.subject | closing-to-opening phase ratio | en |
| dc.subject | Analysis of speaker state | |
| dc.subject | psychological stress detection | |
| dc.subject | glottal pulse analysis | |
| dc.subject | closing-to-opening phase ratio | |
| dc.title | Analysis of Closing-To-Opening Phase Ratio in Top-To-Bottom Glottal Pulse Segmentation for Psychological Stress Detection | en |
| dc.title.alternative | Analysis of Closing-To-Opening Phase Ratio in Top-To-Bottom Glottal Pulse Segmentation for Psychological Stress Detection | en |
| dc.type.driver | article | en |
| dc.type.status | Peer-reviewed | en |
| dc.type.version | publishedVersion | en |
| sync.item.dbid | VAV-128773 | en |
| sync.item.dbtype | VAV | en |
| sync.item.insts | 2025.10.14 14:08:24 | en |
| sync.item.modts | 2025.10.14 10:52:07 | en |
| thesis.grantor | Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií. Ústav radioelektroniky | cs |
| thesis.grantor | Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií. oddělení-REL-SIX | cs |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 16348Article Text4800511020161007.pdf
- Size:
- 1.12 MB
- Format:
- Adobe Portable Document Format
- Description:
- 16348Article Text4800511020161007.pdf
