Skip to main content

Pure Python, lightweight, Pillow-based solver for Amazon\s text captcha.

Project description

The motivation behind the creation of this library is taking its start from the genuinely simple idea: "I don't want to use pytesseract or some other non-amazon-specific OCR services, nor do I want to install some executables to just solve a captcha. I desire to get a solution within 1-2 lines of code without any heavy add-ons, using a pure Python."


Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.

Accuracy Timing Size Version Python version Downloads

Recent News

  • October 8, 2020: tested and approved compatibility with Chromedriver 86.0.4240.22
  • October 7, 2020: tested and approved compatibility with Python 3.9
  • September 20, 2020: dropped support for Python 3.5

Installation

pip install amazoncaptcha

Quick Snippet

An example of the constructor usage. Scroll a bit down to see some tasty class methods.

from amazoncaptcha import AmazonCaptcha

captcha = AmazonCaptcha('captcha.jpg')
solution = captcha.solve()

# Or: solution = AmazonCaptcha('captcha.jpg').solve()

Status

Status Build Status codecov Requirements Status CodeFactor Grade Docs

Usage and Class Methods

Browsing Amazon using selenium and stuck on captcha? The class method below will do all the "dirty" work of extracting an image from the webpage for you. Practically, it takes a screenshot from your webdriver, crops the captcha, and stores it into bytes array, which is then used to create an AmazonCaptcha instance. This also means avoiding any local savings.

from amazoncaptcha import AmazonCaptcha
from selenium import webdriver

driver = webdriver.Chrome() # This is a simplified example
driver.get('https://www.amazon.com/errors/validateCaptcha')

captcha = AmazonCaptcha.fromdriver(driver)
solution = captcha.solve()

If you are not using selenium or the previous method is not just the case for you, it is possible to use a captcha link directly. This class method will request the url, check the content type and store the response content into bytes array to create an instance of AmazonCaptcha.

from amazoncaptcha import AmazonCaptcha

link = 'https://images-na.ssl-images-amazon.com/captcha/usvmgloq/Captcha_kwrrnqwkph.jpg'

captcha = AmazonCaptcha.fromlink(link)
solution = captcha.solve()

In addition, if you are a machine learning or neural network developer and are looking for some training data, check this repository, which was created to store images and other non-script data for the solver.

Help the Development

If you are willing to help the development, consider setting keep_logs argument of the solve method to True. Here is the example, if you are using fromdriver class method. If set to True, all the links of the unsolved captcha will be stored, so later you can open the issue and send the logs.

from amazoncaptcha import AmazonCaptcha
from selenium import webdriver

driver = webdriver.Chrome() # This is a simplified example
driver.get('https://www.amazon.com/errors/validateCaptcha')

captcha = AmazonCaptcha.fromdriver(driver)
solution = captcha.solve(keep_logs=True)

If you have any suggestions or ideas of additional instances and methods, which you would like to see in this library, please, feel free to contact the owner via email or fork'n'pull to repository. Any contribution is highly appreciated!

Additional

  • If you want to see the History of Changes, Code of Conduct, Contributing Policy, or License, use these inline links to navigate based on your needs.
  • If you are facing any errors, please, report your situation via an issue.
  • This project is for educational and research purposes only. Any actions and/or activities related to the material contained on this GitHub Repository is solely your responsibility. The author will not be held responsible in the event any criminal charges be brought against any individuals misusing the information in this GitHub Repository to break the law.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amazoncaptcha-0.4.7.tar.gz (877.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

amazoncaptcha-0.4.7-py3-none-any.whl (937.3 kB view details)

Uploaded Python 3

File details

Details for the file amazoncaptcha-0.4.7.tar.gz.

File metadata

  • Download URL: amazoncaptcha-0.4.7.tar.gz
  • Upload date:
  • Size: 877.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.6

File hashes

Hashes for amazoncaptcha-0.4.7.tar.gz
Algorithm Hash digest
SHA256 c28f730a76e061c45d6269670bedc1e3032afa03fbca6de51dc47e9a1b1afdd4
MD5 aec38a2a6254ddc9041d1923f33a123b
BLAKE2b-256 c24653509f441927db36efd24137512630e92bd7578c0a5134e3c079816f01ef

See more details on using hashes here.

File details

Details for the file amazoncaptcha-0.4.7-py3-none-any.whl.

File metadata

  • Download URL: amazoncaptcha-0.4.7-py3-none-any.whl
  • Upload date:
  • Size: 937.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.6

File hashes

Hashes for amazoncaptcha-0.4.7-py3-none-any.whl
Algorithm Hash digest
SHA256 462e29e470e242a16ac76d0b6ae52a7e953674a87b4de0c9ed13fac29d706b16
MD5 40608702a9d818513949eebd6905465b
BLAKE2b-256 08ef50b1bf9e3e13bd147d1f1927617e887fa08b6bc5cba16843629f460d8f58

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page