Pure Python, lightweight, Pillow-based solver for the Amazon's text captcha.
Project description
Amazon Captcha Solver
Pure Python, lightweight, Pillow-based solver for the Amazon's text captcha.
Motivation behind creation of this library is taking its start from the genuinely simple concept: "I don't want to use pytesseract or some other non-amazon-specific OCR services, neither do I want to install some executables to just solve a captcha. My desire is to get a solution within 1-2 lines of code without any heavy add-ons. Using a pure Python."
Installation
pip install amazoncaptcha
Quick Snippet
from amazoncaptcha import AmazonCaptcha
captcha = AmazonCaptcha('captcha.jpg')
solution = captcha.solve()
# Or: solution = AmazonCaptcha('captcha.jpg').solve()
Status
Popular Usage
from amazoncaptcha import AmazonCaptcha
from selenium import webdriver
driver = webdriver.Chrome()
driver.get('https://www.amazon.com/errors/validateCaptcha')
captcha = AmazonCaptcha.from_webdriver(driver)
solution = captcha.solve()
For Whom?
- Data extraction and web scraping specialists could use this tool, obviously, to bypass the Amazon captcha.
- Machine learning developers could use captchas folder (currently contains 13000 unique solved captchas) based on a demand.
Issues
- If you constantly receive 'Not solved' output, feel free to create an issue and describe details.
- If you received an output, different from solution itself or 'Not solved', please, create an issue or contact me.
- If you've somehow met an Exception, which you don't understand - you know what to do :)
Changes
- Version 0.0.10:
- Reached 10000 training samples.
- Reached 90%+ accuracy.
- Version 0.0.11:
- Fixed error with captcha images that were taken from BytesIO.
- Version 0.0.12:
- Code adjustments and improvements.
- Program can now solve images where last letter is corrupted.
- Version 0.0.13:
- Added and tested 'from_webdriver' classmethod.
- Version 0.1.0:
- 100,000 captchas crash test, accuracy is 98.5%.
- Version 0.1.1 - 0.1.5:
- Code adjustments and improvements.
- Added tests.
- Version 0.2.0:
- Second crash test through 120k+ captchas.
- Accuracy increased to 99.1%
- Code coverage is 100%
- Version 0.3.0:
- Program can now solve images where letters are intercepted.
- Third crash test through 140k+ captchas.
- Accuracy increased to 99.998%
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
amazoncaptcha-0.3.3.tar.gz
(870.0 kB
view hashes)
Built Distribution
amazoncaptcha-0.3.3-py3-none-any.whl
(930.1 kB
view hashes)
Close
Hashes for amazoncaptcha-0.3.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7f477d7dc08a0cf6d2f3eaba0d43e8872838d4a682ec98332b7a3c73ebf4229d |
|
MD5 | fa3f1c73ea18197dc4d01cd8baaf50d2 |
|
BLAKE2b-256 | 733cbc522cb712012218bc5cdcea1bf59aaa147aa32927910e69a40137b3594f |