API for Scrapy spiders
Project description
Arachne provides a wrapper around your scrapy Spider object to run them through a flask app. All you have to do is setup your SPIDER_SETTINGS in the settings file.
Installation
You can install Arachne from pip
pip install -i https://pypi.python.org/pypi/Arachne
Sample settings
This is sample settings file for spiders in your project. The settings file should be called settings.py for Arachne to find it and looks like this:
# settings.py file
SPIDER_SETTINGS = [
{
'endpoint': 'dmoz',
'location': 'spiders.DmozSpider',
'spider': 'DmozSpider'
}
]
Usage
It looks very similar to a flask app but since Scrapy depends on the python twisted package, we need to run our flask app with twisted:
from twisted.web.wsgi import WSGIResource
from twisted.web.server import Site
from twisted.internet import reactor
from arachne import Arachne
app = Arachne(__name__)
resource = WSGIResource(reactor, reactor.getThreadPool(), app)
site = Site(resource)
reactor.listenTCP(8080, site)
if __name__ == '__main__':
reactor.run()
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Arachne-0.3.0.tar.gz
(7.5 kB
view details)
File details
Details for the file Arachne-0.3.0.tar.gz.
File metadata
- Download URL: Arachne-0.3.0.tar.gz
- Upload date:
- Size: 7.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
18ad66e0fe90942404182062169107ce6aa5300d290ce4c07c3f9f2b3d2ce80d
|
|
| MD5 |
8def3751d88f6635b4c7257883a10df4
|
|
| BLAKE2b-256 |
6d15300945a162f31eafc7f0bd5465e13ca88f8bd0c68acd08a357dd3fdac6ab
|