Semi-Supervised Deep Learning Approach For Breaking Geocaching Captchas

Loading...
Thumbnail Image

Date

Authors

Bostik, Ondrej

Advisor

Referee

Mark

Journal Title

Journal ISSN

Volume Title

Publisher

Vysoké učení technické v Brně, Fakulta elektrotechniky a komunikačních technologií

ORCID

Abstract

For nearly two decades, a substantial part of developed anti-abuse and anti-spam systems for web applications called CAPTCHA is based on imperfections in OCR (Optical Character Recognition) algorithms. But with improvements in Deep Learning in OCR, these systems are now obsolete. More and more systems can now break various text Captchas with great accuracy. Now with sufficient training dataset, almost every text-based Captcha scheme can be broken. The focus of this work is to present an idea of a semi-supervised method for reading text-based Captcha which needs only a small initial dataset. The main part of this article is dealing with the problem of training a deep learning system with only a small sample of target Captcha scheme via transfer learning.

Description

Citation

Proceedings II of the 26st Conference STUDENT EEICT 2020: Selected Papers. s. 166-170. ISBN 978-80-214-5868-0
https://conf.feec.vutbr.cz/eeict/EEICT2020

Document type

Peer-reviewed

Document version

Published version

Date of access to the full text

Language of document

en

Study field

Comittee

Date of acceptance

Defence

Result of defence

DOI

Endorsement

Review

Supplemented By

Referenced By

Citace PRO