Make Complex CAPTCHAs Simple: A Fast Text Captcha Solver Based on a Small Number of Samples

Yao Wang,Yuliang Wei,Mingjin Zhang,Yang Liu,Bailing Wang
DOI: https://doi.org/10.1016/j.ins.2021.07.040
IF: 8.1
2021-01-01
Information Sciences
Abstract:Text-based captchas are still widely used by many websites such as Wikipedia and Microsoft despite the emergence of many alternative captchas. Recently, the design of text-based captchas has become more and more complex to resist attacks from automatic cracking programs. However, most of the existing captcha solving methods have certain shortcomings, such as insufficient accuracy, poor generalization performance, and the need for a large number of labeled samples. This study proposes a fast captcha solver that can effectively break text-based captchas with complex security features using a small amount of labeled data. The solver was achieved by constructing a captcha transformation model based on generative adversarial networks to simplify the captcha images before character segmentation and recognition. Results showed that the proposed captcha solver achieved a high success rate of over 96% character accuracy and 74% captcha accuracy for all evaluated schemes. Moreover, the average time to process a single captcha image using a laptop GPU was only 4-8 ms. The effectiveness of this work may encourage captcha designers to reconsider a more secure human-machine distinction mechanism. (c) 2021 Elsevier Inc. All rights reserved.
What problem does this paper attempt to address?