Comprehensive Multiparametric Analysis of Human Deepfake Speech Recognition
| dc.contributor.author | Malinka, Kamil | cs |
| dc.contributor.author | Firc, Anton | cs |
| dc.contributor.author | Šalko, Milan | cs |
| dc.contributor.author | Prudký, Daniel | cs |
| dc.contributor.author | Radačovská, Karolína | cs |
| dc.contributor.author | Hanáček, Petr | cs |
| dc.coverage.issue | 24 | cs |
| dc.coverage.volume | 2024 | cs |
| dc.date.issued | 2024-08-30 | cs |
| dc.description.abstract | In this paper, we undertake a novel two-pronged investigation into the human recognition of deepfake speech, addressing critical gaps in existing research. First, we pioneer an evaluation of the impact of prior information on deepfake recognition, setting our work apart by simulating real-world attack scenarios where individuals are not informed in advance of deepfake exposure. This approach simulates the unpredictability of real-world deepfake attacks, providing unprecedented insights into human vulnerability under realistic conditions. Second, we introduce a novel metric to evaluate the quality of deepfake audio. This metric facilitates a deeper exploration into how the quality of deepfake speech influences human detection accuracy. By examining both the effect of prior knowledge about deepfakes and the role of deepfake speech quality, our research reveals the importance of these factors, contributes to understanding human vulnerability to deepfakes, and suggests measures to enhance human detection skills. | en |
| dc.description.abstract | In this paper, we undertake a novel two-pronged investigation into the human recognition of deepfake speech, addressing critical gaps in existing research. First, we pioneer an evaluation of the impact of prior information on deepfake recognition, setting our work apart by simulating real-world attack scenarios where individuals are not informed in advance of deepfake exposure. This approach simulates the unpredictability of real-world deepfake attacks, providing unprecedented insights into human vulnerability under realistic conditions. Second, we introduce a novel metric to evaluate the quality of deepfake audio. This metric facilitates a deeper exploration into how the quality of deepfake speech influences human detection accuracy. By examining both the effect of prior knowledge about deepfakes and the role of deepfake speech quality, our research reveals the importance of these factors, contributes to understanding human vulnerability to deepfakes, and suggests measures to enhance human detection skills. | en |
| dc.format | text | cs |
| dc.format.extent | 1-25 | cs |
| dc.format.mimetype | application/pdf | cs |
| dc.identifier.citation | EURASIP Journal on Image and Video Processing. 2024, vol. 2024, issue 24, p. 1-25. | en |
| dc.identifier.doi | 10.1186/s13640-024-00641-4 | cs |
| dc.identifier.issn | 1687-5176 | cs |
| dc.identifier.orcid | 0000-0002-9009-2193 | cs |
| dc.identifier.orcid | 0000-0002-4717-1910 | cs |
| dc.identifier.orcid | 0009-0004-9604-168X | cs |
| dc.identifier.orcid | 0000-0001-5507-0768 | cs |
| dc.identifier.other | 189344 | cs |
| dc.identifier.researcherid | AAB-5046-2022 | cs |
| dc.identifier.researcherid | HJP-8074-2023 | cs |
| dc.identifier.scopus | 24824985000 | cs |
| dc.identifier.scopus | 57699371300 | cs |
| dc.identifier.scopus | 6508388287 | cs |
| dc.identifier.uri | http://hdl.handle.net/11012/252348 | |
| dc.language.iso | en | cs |
| dc.publisher | Springer Nature | cs |
| dc.relation.ispartof | EURASIP Journal on Image and Video Processing | cs |
| dc.relation.uri | https://jivp-eurasipjournals.springeropen.com/articles/10.1186/s13640-024-00641-4 | cs |
| dc.rights | Creative Commons Attribution 4.0 International | cs |
| dc.rights.access | openAccess | cs |
| dc.rights.sherpa | http://www.sherpa.ac.uk/romeo/issn/1687-5176/ | cs |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | cs |
| dc.subject | Deepfake | en |
| dc.subject | Synthetic speech | en |
| dc.subject | Deepfake detection | en |
| dc.subject | Human perception | en |
| dc.subject | Speech quality | en |
| dc.subject | Cybersecurity<br> | en |
| dc.subject | Deepfake | |
| dc.subject | Synthetic speech | |
| dc.subject | Deepfake detection | |
| dc.subject | Human perception | |
| dc.subject | Speech quality | |
| dc.subject | Cybersecurity<br> | |
| dc.title | Comprehensive Multiparametric Analysis of Human Deepfake Speech Recognition | en |
| dc.title.alternative | Comprehensive Multiparametric Analysis of Human Deepfake Speech Recognition | en |
| dc.type.driver | article | en |
| dc.type.status | Peer-reviewed | en |
| dc.type.version | publishedVersion | en |
| sync.item.dbid | VAV-189344 | en |
| sync.item.dbtype | VAV | en |
| sync.item.insts | 2025.10.14 14:13:17 | en |
| sync.item.modts | 2025.10.14 09:47:28 | en |
| thesis.grantor | Vysoké učení technické v Brně. Fakulta informačních technologií. Ústav inteligentních systémů | cs |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- s13640024006414.pdf
- Size:
- 1.78 MB
- Format:
- Adobe Portable Document Format
- Description:
- file s13640024006414.pdf
