Skip to main content

Python tool that allows you to take multiple full page screenshots of web pages without ads.

Project description

Webarchiver

Version: 0.12.0

Python tool that allows you to take full page screenshots of pages without ads

Supports batching by adding multiple links in a text file, or my adding links to command line separated by commas.

Requirements:

One of the following browsers:

  • Chrome/Chromium browser
  • Firefox
  • Selenoid Server

Usage:

Short Flag Long Flag Description
-h --help See Usage
-b --browser Specify browser: Chrome / Firefox / Selenoid
-c --clean Convert mobile sites to regular site
-d --directory Location where the images will be saved
--dpi DPI for the image
-e --executor Execution environmment: Local / Selenoid Host|Selenoid URL
-f --file Text file to read the URLs from
-l --links Comma separated URLs (No spaces)
-i --image-type Save images as PNG or JPEG
-p --processes Number of processes to run concurrently
-u --url-filter Filter URLs that contain this string
-z --zoom The zoom to use on the browser

Example:

webarchiver -c -f <links_file.txt> -l "<URL1,URL2,URL3>" -i 'jpeg' -d "~/Downloads" -z 100 --dpi 1 --browser "Firefox"
webarchiver -c -f <links_file.txt> -l "<URL1,URL2,URL3>" -i 'png' -d "~/Downloads" -z 100 --dpi 1 --executor "selenoid|http://selenoid.com/wd/hub" --browser "Chrome"

Install Instructions

Install Python Package

python -m pip install webarchiver

Build Instructions

Build Python Package

sudo chmod +x ./*.py
pip install .
python setup.py bdist_wheel --universal
# Test Pypi
twine upload --repository-url https://test.pypi.org/legacy/ dist/* --verbose -u "Username" -p "Password"
# Prod Pypi
twine upload dist/* --verbose -u "Username" -p "Password"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

webarchiver-0.12.0-py2.py3-none-any.whl (5.9 MB view details)

Uploaded Python 2Python 3

File details

Details for the file webarchiver-0.12.0-py2.py3-none-any.whl.

File metadata

  • Download URL: webarchiver-0.12.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 5.9 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for webarchiver-0.12.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 5b58928fcc041deb76d8de3c53d4790420867eee35a22db14a7cd0bf8b2660ab
MD5 9f0e334efe1cef16a378f64062efdb99
BLAKE2b-256 d91c248ab6e82cd091e6890013f606dfa53c50e4101d7296adfd26fe1350925c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page