Skip to main content

A framework for creating web content extractors

Project description

Travis-CI Build Status Downloads Latest Version

Scrapple is a framework for creating web scrapers and web crawlers according to a key-value based configuration file. It provides a command line interface to run the script on a given JSON-based configuration input, as well as a web interface to provide the necessary input.

You can install Scrapple by using

$ sudo apt-get install libxml2-dev libxslt-dev python-dev lib32z1-dev
$ pip install scrapple

You can read the complete documentation.

History

0.2.2 - 2015-02-22

  • Fix bug in generate script template

0.2.1 - 2015-02-21

  • Update tests

0.2.0 - 2015-02-20

  • Include implementation for scrapple run and scrapple generate for crawlers

  • Modify web interface for editing scraper config files

  • Revise skeleton configuration files

0.1.1 - 2015-02-10

  • Release on PyPI with revisions

  • Include web interface for editing scraper config files

  • Modified implementations of certain functions

0.1.0 - 2015-02-04

  • First release on PyPI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

scrapple-0.2.2.tar.gz (443.8 kB view details)

Uploaded Source

scrapple-0.2.2.linux-i686.tar.gz (314.9 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

scrapple-0.2.2-py2.py3-none-any.whl (321.5 kB view details)

Uploaded Python 2Python 3

scrapple-0.2.2-py2.7.egg (327.9 kB view details)

Uploaded Egg

scrapple-0.2.2-py2-none-any.whl (321.5 kB view details)

Uploaded Python 2

File details

Details for the file scrapple-0.2.2.tar.gz.

File metadata

  • Download URL: scrapple-0.2.2.tar.gz
  • Upload date:
  • Size: 443.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for scrapple-0.2.2.tar.gz
Algorithm Hash digest
SHA256 b8a4285730481b29a53e283eef65695a68260743c0ac8dd171bced6b1c674438
MD5 e8236d5e92e68a78ff2032f5e885a2cb
BLAKE2b-256 9aec36946dad2984b30e3c7102816cf9bf7d2c6a669a831bc45bd50111982a02

See more details on using hashes here.

File details

Details for the file scrapple-0.2.2.linux-i686.tar.gz.

File metadata

File hashes

Hashes for scrapple-0.2.2.linux-i686.tar.gz
Algorithm Hash digest
SHA256 b0b6ab4ab31553fda1c1333d8b07f1946f5c0a40c940023cd9219c9930e15d23
MD5 8d7fb029d633f1a72b9964a620b7ca0e
BLAKE2b-256 a41c8c75185c04f5ae4e12e5e55da8a4d1d5b83858ba7ac2b6c9820d9de39baa

See more details on using hashes here.

File details

Details for the file scrapple-0.2.2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for scrapple-0.2.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 4d4c79545529e934f3e1cfdc7f583255f30232d2df9e893b49c8f9ebcb5debc8
MD5 a1a4ee994db42475c20a47ddbef51271
BLAKE2b-256 9dfc55237bdd9c3992f6562754e8b7277220a9eafaae7739a187f88dead984cb

See more details on using hashes here.

File details

Details for the file scrapple-0.2.2-py2.7.egg.

File metadata

  • Download URL: scrapple-0.2.2-py2.7.egg
  • Upload date:
  • Size: 327.9 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for scrapple-0.2.2-py2.7.egg
Algorithm Hash digest
SHA256 d3338640b42e2d6fb9d4e2eb2b6b0290fc6e3202a42ea8c7fc5e3c16d269bafe
MD5 a361183b5c17daeb344dfd288487b014
BLAKE2b-256 76e781e2e5c4887fe111b9b8e765ffb7236c618d1f1ec220b833cf9f58f7eb91

See more details on using hashes here.

File details

Details for the file scrapple-0.2.2-py2-none-any.whl.

File metadata

File hashes

Hashes for scrapple-0.2.2-py2-none-any.whl
Algorithm Hash digest
SHA256 1382aeb6658b0801bb98594ec886cc03d84c4ca2f5db58544b58b987d904c3f7
MD5 aa24528625e586f7198fcea60f2aa1a5
BLAKE2b-256 18ed1a539358d8834b8ad8d4dd463e5a2d4b64843f2600a9ea76d436fe9cac88

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page