Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.10+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web parsing code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web_poet-0.23.2.tar.gz (72.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

web_poet-0.23.2-py3-none-any.whl (47.7 kB view details)

Uploaded Python 3

File details

Details for the file web_poet-0.23.2.tar.gz.

File metadata

  • Download URL: web_poet-0.23.2.tar.gz
  • Upload date:
  • Size: 72.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for web_poet-0.23.2.tar.gz
Algorithm Hash digest
SHA256 fd11eb09f0b4c7f7e82d5ba142319c268386f6d5709123491491b14fdc4042eb
MD5 704cb3295720657be9f5070612428f00
BLAKE2b-256 75a0d749937d3a7fdaeefbbccb1f3021282380a80c2e97cb25745b8de6640a66

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.23.2.tar.gz:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file web_poet-0.23.2-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.23.2-py3-none-any.whl
  • Upload date:
  • Size: 47.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for web_poet-0.23.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ff9a30ae0bcfd2ba497ae4409e71d0b39b7e40c270b4379617b655dfd6362600
MD5 5dd8b42e486da5614f3a97e34a3c61f2
BLAKE2b-256 248ac61631257d92e3af69e92ed6475feee007d488db2b06d371c00cd33925c0

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.23.2-py3-none-any.whl:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page