Scrapfly SDK for Scrapfly
Project description
Scrapfly SDK
Installation
pip install scrapfly-sdk
You can also install extra dependencies
pip install "scrapfly-sdk[seepdup]"for performance improvementpip install "scrapfly-sdk[concurrency]"for concurrency out of the box (asyncio / thread)pip install "scrapfly-sdk[scrapy]"for scrapy integrationpip install "scrapfly-sdk[all]"Everything!
For use of built-in HTML parser (via ScrapeApiResponse.selector property) additional requirement of either parsel or scrapy is required.
Get Your API Key
You can create a free account on Scrapfly to get your API Key.
Migration
Migrate from 0.7.x to 0.8
asyncio-pool dependency has been dropped
scrapfly.concurrent_scrape is now an async generator. If the concurrency is None or not defined, the max concurrency allowed by
your current subscription is used.
async for result in scrapfly.concurrent_scrape(concurrency=10, scrape_configs=[ScrapConfig(...), ...]):
print(result)
brotli args is deprecated and will be removed in the next minor. There is not benefit in most of case versus gzip regarding and size and use more CPU.
What's new
0.8.x
- Better error log
- Async/Improvement for concurrent scrape with asyncio
- Scrapy media pipeline are now supported out of the box
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file scrapfly-sdk-0.8.10.tar.gz.
File metadata
- Download URL: scrapfly-sdk-0.8.10.tar.gz
- Upload date:
- Size: 24.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
76d35fb846bfc40e444985d71f52097c7c3131c77071e16225ec2ff67caea411
|
|
| MD5 |
3f6b534ed808ee41187e3148691f17de
|
|
| BLAKE2b-256 |
8c1e82b8441cf21f56b6f335e1206663000c1ec433a3e7c94498824129078bc5
|
File details
Details for the file scrapfly_sdk-0.8.10-py3-none-any.whl.
File metadata
- Download URL: scrapfly_sdk-0.8.10-py3-none-any.whl
- Upload date:
- Size: 28.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ee662fef4b9b3e2560b9ed1d60c979e8f97b8041778fabec793ef7acd86ed4be
|
|
| MD5 |
73297e41b42b2466d8eb7b7b6f5cc59d
|
|
| BLAKE2b-256 |
ba97aa8085ee683b91a15b1b077d64cb25383ce9200329ddac516528702f0ff1
|