Skip to main content

Scrapy extension to sync `.scrapy` folder to an S3 bucket

Project description

Scrapy extension to sync .scrapy folder to an S3 bucket.

Installation

You can install scrapy-dotpersistence using pip:

pip install scrapy-dotpersistence

You can then enable the extension in your settings.py:

EXTENSIONS = {
    ...
    'scrapy_dotpersistence.DotScrapyPersistence': 0
}

How to use it

Enable extension through settings.py:

DOTSCRAPY_ENABLED = True

Configure the exension through settings.py:

ADDONS_AWS_ACCESS_KEY_ID = "ABC"
ADDONS_AWS_SECRET_ACCESS_KEY = "DEF"
ADDONS_AWS_USERNAME = "username"
ADDONS_S3_BUCKET = "test-bucket-name"

You can change a dotpersistence folder path with environ:

export DOTSCRAPY_DIR='/tmp/.scrapy'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapy-dotpersistence-0.2.0.tar.gz (2.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scrapy_dotpersistence-0.2.0-py2-none-any.whl (4.2 kB view details)

Uploaded Python 2

File details

Details for the file scrapy-dotpersistence-0.2.0.tar.gz.

File metadata

File hashes

Hashes for scrapy-dotpersistence-0.2.0.tar.gz
Algorithm Hash digest
SHA256 34eeeaebbab2cd92374a9ec7aa2b9451cc6bfe5c6e6bb857d7dcec08d58f810c
MD5 afa58771bfcd3b0ec788a439dfd7eca0
BLAKE2b-256 1e6c3a5505068b31045cb6e78a8231d46dc78e5033da92c84c5424d105e7ec6e

See more details on using hashes here.

File details

Details for the file scrapy_dotpersistence-0.2.0-py2-none-any.whl.

File metadata

File hashes

Hashes for scrapy_dotpersistence-0.2.0-py2-none-any.whl
Algorithm Hash digest
SHA256 0500f9b25e43673e71730678d098257cf23f1bdd821473312ec355cabeed5e19
MD5 71599621f9ed19c6f7232e171ec561fb
BLAKE2b-256 20524206dfe77d46559b77927bd160742063d89bb360bfb7299013330f64f2e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page