Skip to main content

Pure Python, lightweight, Pillow-based solver for the Amazon's text captcha.

Project description

Amazon Captcha Solver

Motivation behind creation of this library is taking its start from the genuinely simple idea: "I don't want to use pytesseract or some other non-amazon-specific OCR services, nor do I want to install some executables to just solve a captcha. My desire is to get a solution within 1-2 lines of code without any heavy add-ons. Using a pure Python."


Pure Python, lightweight, Pillow-based solver for the Amazon's text captcha.

Accuracy Timing Size Version Python version Downloads

Installation

pip install amazoncaptcha

Quick Snippet

from amazoncaptcha import AmazonCaptcha

captcha = AmazonCaptcha('captcha.jpg')
solution = captcha.solve()

# Or: solution = AmazonCaptcha('captcha.jpg').solve()

Status

Status Build Status codecov Requirements Status CodeFactor Grade Implementation

Popular Usage

from amazoncaptcha import AmazonCaptcha
from selenium import webdriver

driver = webdriver.Chrome()
driver.get('https://www.amazon.com/errors/validateCaptcha')

captcha = AmazonCaptcha.from_webdriver(driver)
solution = captcha.solve()

For Whom?

  • Data extraction and web scraping specialists could use this tool, obviously, to bypass the Amazon captcha.
  • Machine learning developers could use captchas folder (currently contains 13000 unique solved captchas) based on a demand.

Issues

  • If you constantly receive 'Not solved' output, feel free to create an issue and describe details.
  • If you received an output, different from solution itself or 'Not solved', please, create an issue or contact me.
  • If you've somehow met an Exception, which you don't understand - you know what to do :)

Changes

  • Version 0.0.10:
    1. Reached 10000 training samples.
    2. Reached 90%+ accuracy.
  • Version 0.0.11:
    1. Fixed error with captcha images that were taken from BytesIO.
  • Version 0.0.12:
    1. Code adjustments and improvements.
    2. Program can now solve images where last letter is corrupted.
  • Version 0.0.13:
    1. Added and tested 'from_webdriver' classmethod.
  • Version 0.1.0:
    1. 100,000 captchas crash test, accuracy is 98.5%.
  • Version 0.1.1 - 0.1.5:
    1. Code adjustments and improvements.
    2. Added tests.
  • Version 0.2.0:
    1. Second crash test through 120k+ captchas.
    2. Accuracy increased to 99.1%
    3. Code coverage is 100%
  • Version 0.3.0:
    1. Program can now solve images where letters are intercepted.
    2. Third crash test through 140k+ captchas.
    3. Accuracy increased to 99.998%

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amazoncaptcha-0.3.6.tar.gz (870.1 kB view hashes)

Uploaded Source

Built Distribution

amazoncaptcha-0.3.6-py3-none-any.whl (930.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page