Python tool that allows you to take multiple full page screenshots of web pages without ads.
Project description
Webarchiver
Version: 0.12.0
Python tool that allows you to take full page screenshots of pages without ads
Supports batching by adding multiple links in a text file, or my adding links to command line separated by commas.
Requirements:
One of the following browsers:
- Chrome/Chromium browser
- Firefox
- Selenoid Server
Usage:
| Short Flag | Long Flag | Description |
|---|---|---|
| -h | --help | See Usage |
| -b | --browser | Specify browser: Chrome / Firefox / Selenoid |
| -c | --clean | Convert mobile sites to regular site |
| -d | --directory | Location where the images will be saved |
| --dpi | DPI for the image | |
| -e | --executor | Execution environmment: Local / Selenoid Host|Selenoid URL |
| -f | --file | Text file to read the URLs from |
| -l | --links | Comma separated URLs (No spaces) |
| -i | --image-type | Save images as PNG or JPEG |
| -p | --processes | Number of processes to run concurrently |
| -u | --url-filter | Filter URLs that contain this string |
| -z | --zoom | The zoom to use on the browser |
Example:
webarchiver -c -f <links_file.txt> -l "<URL1,URL2,URL3>" -i 'jpeg' -d "~/Downloads" -z 100 --dpi 1 --browser "Firefox"
webarchiver -c -f <links_file.txt> -l "<URL1,URL2,URL3>" -i 'png' -d "~/Downloads" -z 100 --dpi 1 --executor "selenoid|http://selenoid.com/wd/hub" --browser "Chrome"
Install Instructions
Install Python Package
python -m pip install webarchiver
Build Instructions
Build Python Package
sudo chmod +x ./*.py
pip install .
python setup.py bdist_wheel --universal
# Test Pypi
twine upload --repository-url https://test.pypi.org/legacy/ dist/* --verbose -u "Username" -p "Password"
# Prod Pypi
twine upload dist/* --verbose -u "Username" -p "Password"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file webarchiver-0.12.0-py2.py3-none-any.whl.
File metadata
- Download URL: webarchiver-0.12.0-py2.py3-none-any.whl
- Upload date:
- Size: 5.9 MB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b58928fcc041deb76d8de3c53d4790420867eee35a22db14a7cd0bf8b2660ab
|
|
| MD5 |
9f0e334efe1cef16a378f64062efdb99
|
|
| BLAKE2b-256 |
d91c248ab6e82cd091e6890013f606dfa53c50e4101d7296adfd26fe1350925c
|