Direct Gene Detection in Raw Nanopore Signals Using Transformer Neural Networks

Loading...
Thumbnail Image

Date

Authors

Vorochta, Jevhenij
Vítková, Helena
Jakubíček, Roman

Advisor

Referee

Mark

Journal Title

Journal ISSN

Volume Title

Publisher

Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií

ORCID

Altmetrics

Abstract

Nanopore sequencing has transformed genomics by enabling real-time analysis of DNA and RNA in a compact, cost-effective device. However, conventional workflows require a separate basecalling step to convert raw electrical signals into nucleotide sequences, which can introduce errors and delay downstream analyses such as gene detection. Here, we present a novel approach that bypasses basecalling by directly analyzing raw nanopore signals using a transformer-based neural network. By adapting a model originally designed for ECG classification, we developed a system capable of detecting specific antibiotic resistance genes in Klebsiella pneumoniae samples. Raw signals were preprocessed through downsampling, z-normalization, and segmentation into 5,000-sample windows, yielding a dataset of 13,080 labeled segments. Experimental results demonstrate that our model effectively distinguishes gene-containing segments from non-target signals, achieving up to 80% accuracy in the “no target gene” category. In contrast, accuracy for other gene categories was lower, indicating that further optimization of the model is required. This direct-signal approach not only reduces the computational burden associated with basecalling but also streamlines the workflow, promising faster diagnostic turnaround times. These findings provide a significant step toward integrating advanced deep learning methods with nanopore sequencing for rapid, on-site genomic analysis and have potential applications in clinical diagnostics and epidemiological surveillance.

Description

Citation

Proceedings II of the 31st Conference STUDENT EEICT 2025: Selected papers. s. 13-16. ISBN 978-80-214-6320-2
https://www.eeict.cz/eeict_download/archiv/sborniky/EEICT_2025_sbornik_2.pdf

Document type

Peer-reviewed

Document version

Published version

Date of access to the full text

Language of document

en

Study field

Comittee

Date of acceptance

Defence

Result of defence

Endorsement

Review

Supplemented By

Referenced By

Citace PRO