Skip to main content

Simple scraper for kleinanzeigen.de searches with notifications for new ads.

Project description

ek-scraper

Simple scraper for kleinanzeigen.de searches with notifications for new ads.

Installation

Install this package from PyPi in a separate virtual environment using pipx.

pipx install ek-scraper

Usage

For the full usage check the ek-scraper --help command

Create a configuration file using

ek-scraper create-config <path/to/config.json>

The example configuration file will look like this:

{
  "filter": {
    "exclude_topads": true,
    "exclude_patterns": []
  },
  "notifications": {
    "pushover": {
        "token": "<your-app-api-token>",
        "user": "<your-user-api-token>",
        "device": []
    },
    "ntfy.sh": {
      "topic": "<your-private-topic>",
      "priority": 3
    },
  },
  "searches": [
    {
      "name": "Wohnungen in Hamburg Altona",
      "url": "https://www.kleinanzeigen.de/s-wohnung-mieten/altona/c203l9497",
      "recursive": true
    }
  ]
}

See Configuration for details on all configuration options.

  • Configure one or more searches in the searches section of the configuration, see Searches for more details
  • Configure notifications in the notifications section of the configuration, see Notifications for details on notification configuration
  • (Optional) Configure filters in the filter section of the configuration, see Filter for more details

Run the following command to initialize the data store without sending any notifications:

ek-scraper run --no-notifications path/to/config.json

Afterwards, run

ek-scraper run path/to/config.json

to receive notifications according to your notifications configuration.

Development

Follow the steps below to set up a development environment for this project.

  1. Clone this repository

    git clone git@github.com:jonasehrlich/ek-scraper.git
    
  2. Change directory into the repository

    cd ek-scraper
    
  3. Create a virtual environment using poetry

    poetry install
    
  4. (Optional) Install pre-commit environment

    $ pre-commit
    [INFO] Installing environment for https://github.com/pre-commit/pre-commit-hooks.
    [INFO] Once installed this environment will be reused.
    [INFO] This may take a few minutes...
    [INFO] Installing environment for https://github.com/psf/black.
    [INFO] Once installed this environment will be reused.
    [INFO] This may take a few minutes...
    Check Yaml...........................................(no files to check)Skipped
    Fix End of Files.....................................(no files to check)Skipped
    Trim Trailing Whitespace.............................(no files to check)Skipped
    black................................................(no files to check)Skipped
    

Configuration

Searches

Searches can be configured in the searches array of the configuration file. Each of the searches can be configured with the following parameters.

Name Description
name Name of the search, use a descriptive one (required)
url URL of the first page of your search (required)
recursive Whether to follow all pages of the search result
(optional, defaults to true)

Filter

Filters can be configured in the filter section of the configuration file to exclude specific ads from your scrape results on the client side. The following settings can be configured.

Name Description
exclude_topads Whether to exclude top ads from the results (optional, defaults to true)
exclude_patterns Case-insensitive regular expression patterns used to exclude ads (optional)

Notifications

Notifications can be configured in the notifications section of the configuration file.

Push notifications using Pushover

Screenshot of a push notification using Pushover

ek-scraper supports push notifications to your devices using Pushover. For further information on the service check their terms and conditions.

The implementation for Pushover notifications will send a single notification per search, if new ads are discovered.

To configure Pushover for notifications from the scraper, first register at the service and create an application (e.g. ek-scraper). To use the service in ek-scraper, add the pushover object to the notifications object in your configuration file and fill the API tokens. Selection of the devices which will receive the notifications, is supported using the device array.

Name Description
token API token of the Pushover app (required)
user API token of the Pushover user (required)
device List of device names to send the notifications to
(optional, defaults to all devices)

Push notifications using ntfy.sh

Screenshot of a push notification using ntfy.sh

ek-scraper supports push notifications to your devices using ntfy.sh. For further information on the service check their terms and conditions.

The implementation for ntfy.sh notifications will send a single notification per search, if new ads are discovered.

To configure ntfy.sh for notifications from the scraper, define a topic and subscribe to it in the mobile app.

Note that topic names are public, so it's wise to choose something that cannot be guessed easily. This can be done by including a UUID, e.g. by running the following command in your shell:

echo "ek-scraper-$(uuidgen)"

To use the service in ek-scraper, add the ntfy.sh object to the notifications object in your configuration file and add the topic you previously subscribed to.

Name Description
topic Topic to publish the notifications to
priority Priority to send the notifications with (optional, defaults to 3)

Running ek-scraper regularly

It should be avoided to run the tool too often to avoid getting your IP address blocked by kleinanzeigen.de

In order to run ek-scraper regularly on a Unix-like system, configure it as a cronjob.

To configure a cronjob, run

crontab -e

Edit the crontab table to run the command you want to run. A handy tool to check schedule configurations for cronjobs is crontab.guru.

For more information on configuring cronjobs use your favorite search engine.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ek_scraper-0.2.3.tar.gz (14.3 kB view hashes)

Uploaded Source

Built Distribution

ek_scraper-0.2.3-py3-none-any.whl (16.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page