Comprehensive Multiparametric Analysis of Human Deepfake Speech Recognition

Malinka, Kamil; Firc, Anton; Šalko, Milan; Prudký, Daniel; Radačovská, Karolína; Hanáček, Petr

doi:10.1186/s13640-024-00641-4

Comprehensive Multiparametric Analysis of Human Deepfake Speech Recognition

dc.contributor.author	Malinka, Kamil	cs
dc.contributor.author	Firc, Anton	cs
dc.contributor.author	Šalko, Milan	cs
dc.contributor.author	Prudký, Daniel	cs
dc.contributor.author	Radačovská, Karolína	cs
dc.contributor.author	Hanáček, Petr	cs
dc.coverage.issue	24	cs
dc.coverage.volume	2024	cs
dc.date.issued	2024-08-30	cs
dc.description.abstract	In this paper, we undertake a novel two-pronged investigation into the human recognition of deepfake speech, addressing critical gaps in existing research. First, we pioneer an evaluation of the impact of prior information on deepfake recognition, setting our work apart by simulating real-world attack scenarios where individuals are not informed in advance of deepfake exposure. This approach simulates the unpredictability of real-world deepfake attacks, providing unprecedented insights into human vulnerability under realistic conditions. Second, we introduce a novel metric to evaluate the quality of deepfake audio. This metric facilitates a deeper exploration into how the quality of deepfake speech influences human detection accuracy. By examining both the effect of prior knowledge about deepfakes and the role of deepfake speech quality, our research reveals the importance of these factors, contributes to understanding human vulnerability to deepfakes, and suggests measures to enhance human detection skills.	en
dc.description.abstract	In this paper, we undertake a novel two-pronged investigation into the human recognition of deepfake speech, addressing critical gaps in existing research. First, we pioneer an evaluation of the impact of prior information on deepfake recognition, setting our work apart by simulating real-world attack scenarios where individuals are not informed in advance of deepfake exposure. This approach simulates the unpredictability of real-world deepfake attacks, providing unprecedented insights into human vulnerability under realistic conditions. Second, we introduce a novel metric to evaluate the quality of deepfake audio. This metric facilitates a deeper exploration into how the quality of deepfake speech influences human detection accuracy. By examining both the effect of prior knowledge about deepfakes and the role of deepfake speech quality, our research reveals the importance of these factors, contributes to understanding human vulnerability to deepfakes, and suggests measures to enhance human detection skills.	en
dc.format	text	cs
dc.format.extent	1-25	cs
dc.format.mimetype	application/pdf	cs
dc.identifier.citation	EURASIP Journal on Image and Video Processing. 2024, vol. 2024, issue 24, p. 1-25.	en
dc.identifier.doi	10.1186/s13640-024-00641-4	cs
dc.identifier.issn	1687-5176	cs
dc.identifier.orcid	0000-0002-9009-2193	cs
dc.identifier.orcid	0000-0002-4717-1910	cs
dc.identifier.orcid	0009-0004-9604-168X	cs
dc.identifier.orcid	0000-0001-5507-0768	cs
dc.identifier.other	189344	cs
dc.identifier.researcherid	AAB-5046-2022	cs
dc.identifier.researcherid	HJP-8074-2023	cs
dc.identifier.scopus	24824985000	cs
dc.identifier.scopus	57699371300	cs
dc.identifier.scopus	6508388287	cs
dc.identifier.uri	http://hdl.handle.net/11012/252348
dc.language.iso	en	cs
dc.publisher	Springer Nature	cs
dc.relation.ispartof	EURASIP Journal on Image and Video Processing	cs
dc.relation.uri	https://jivp-eurasipjournals.springeropen.com/articles/10.1186/s13640-024-00641-4	cs
dc.rights	Creative Commons Attribution 4.0 International	cs
dc.rights.access	openAccess	cs
dc.rights.sherpa	http://www.sherpa.ac.uk/romeo/issn/1687-5176/	cs
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	cs
dc.subject	Deepfake	en
dc.subject	Synthetic speech	en
dc.subject	Deepfake detection	en
dc.subject	Human perception	en
dc.subject	Speech quality	en
dc.subject	Cybersecurity<br>	en
dc.subject	Deepfake
dc.subject	Synthetic speech
dc.subject	Deepfake detection
dc.subject	Human perception
dc.subject	Speech quality
dc.subject	Cybersecurity<br>
dc.title	Comprehensive Multiparametric Analysis of Human Deepfake Speech Recognition	en
dc.title.alternative	Comprehensive Multiparametric Analysis of Human Deepfake Speech Recognition	en
dc.type.driver	article	en
dc.type.status	Peer-reviewed	en
dc.type.version	publishedVersion	en
sync.item.dbid	VAV-189344	en
sync.item.dbtype	VAV	en
sync.item.insts	2025.10.14 14:13:17	en
sync.item.modts	2025.10.14 09:47:28	en
thesis.grantor	Vysoké učení technické v Brně. Fakulta informačních technologií. Ústav inteligentních systémů	cs

Files

Original bundle

Now showing 1 - 1 of 1

Name:: s13640024006414.pdf
Size:: 1.78 MB
Format:: Adobe Portable Document Format
Description:: file s13640024006414.pdf

Download

Collections

Ústav inteligentních systémů