Skip to main content

Load data about Python packages from PyPI into SQLite

Project description

pypi-to-sqlite

PyPI Changelog Tests License

Load data about Python packages from PyPI into SQLite

Installation

Install this tool using pip:

pip install pypi-to-sqlite

Usage

To create a SQLite database with details of one or more packages, run:

pypi-to-sqlite pypi.db datasette sqlite-utils

You can also process JSON that you have previously saved to disk like so:

curl -o datasette.json https://pypi.org/pypi/datasette/json
pypi-to-sqlite pypi.db -f datasette.json

The tool will create three tables: packages, versions and releases. The full table schema is shown below.

To create the tables with a prefix, use --prefix prefix. For example:

pypi-to-sqlite pypi.db datasette --prefix pypi_

This will create tables called pypi_packages, pypi_versions and pypi_releases.

Demo

You can see examples of tables created using this tool running in Datasette here:

Database schema

CREATE TABLE [packages] (
   [name] TEXT PRIMARY KEY,
   [summary] TEXT,
   [classifiers] TEXT,
   [description] TEXT,
   [author] TEXT,
   [author_email] TEXT,
   [description_content_type] TEXT,
   [home_page] TEXT,
   [keywords] TEXT,
   [license] TEXT,
   [maintainer] TEXT,
   [maintainer_email] TEXT,
   [package_url] TEXT,
   [platform] TEXT,
   [project_url] TEXT,
   [project_urls] TEXT,
   [release_url] TEXT,
   [requires_dist] TEXT,
   [requires_python] TEXT,
   [version] TEXT,
   [yanked] INTEGER,
   [yanked_reason] TEXT
);
CREATE TABLE [versions] (
   [id] TEXT PRIMARY KEY,
   [package] TEXT REFERENCES [packages]([name]),
   [name] TEXT
);
CREATE TABLE [releases] (
   [md5_digest] TEXT PRIMARY KEY,
   [package] TEXT REFERENCES [packages]([name]),
   [version] TEXT REFERENCES [versions]([id]),
   [packagetype] TEXT,
   [filename] TEXT,
   [comment_text] TEXT,
   [digests] TEXT,
   [has_sig] INTEGER,
   [python_version] TEXT,
   [requires_python] TEXT,
   [size] INTEGER,
   [upload_time] TEXT,
   [upload_time_iso_8601] TEXT,
   [url] TEXT,
   [yanked] INTEGER,
   [yanked_reason] TEXT
);

pypi-to-sqlite --help

Usage: pypi-to-sqlite [OPTIONS] DB_PATH [PACKAGE]...

  Load data about Python packages from PyPI into SQLite

  Usage example:

      pypi-to-sqlite pypy.db datasette sqlite-utils

  Use -f to load data from a JSON file instead:

      pypi-to-sqlite pypy.db -f datasette.json

  Created tables will be packages, versions and releases

  To create tables called pypi_packages, pypi_versions, pypi_releases use
  --prefix pypi_:

      pypi-to-sqlite pypy.db datasette sqlite-utils --prefix pypi_

Options:
  --version            Show the version and exit.
  -f, --file FILENAME  Import JSON from this file
  -d, --delay FLOAT    Wait this many seconds between requests
  --prefix TEXT        Prefix to use for the created database tables
  --help               Show this message and exit.

Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

cd pypi-to-sqlite
python -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

pip install -e '.[test]'

To run the tests:

pytest

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypi-to-sqlite-0.2.3.tar.gz (9.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pypi_to_sqlite-0.2.3-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file pypi-to-sqlite-0.2.3.tar.gz.

File metadata

  • Download URL: pypi-to-sqlite-0.2.3.tar.gz
  • Upload date:
  • Size: 9.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for pypi-to-sqlite-0.2.3.tar.gz
Algorithm Hash digest
SHA256 d6866be63e347d5733d259e0708ffc6e69409744203c82e438ee3bbda44a4e20
MD5 4f27ea2f944de13c32853235156e4772
BLAKE2b-256 1f29cf97b1c72cb1aab81917e44d52e34ba3ffbb7efdfa2125ccf7cedc852602

See more details on using hashes here.

File details

Details for the file pypi_to_sqlite-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: pypi_to_sqlite-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 8.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for pypi_to_sqlite-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 20f720cecb4920b7456deb0c718cefb321fb5124f7058fe1b203840a64ec98c0
MD5 9214205f884e9197a36072a7024d1d03
BLAKE2b-256 edb1b417733d3808d635e5b6aa29ec829df81eddfebaf2f7b37379c17a6a3106

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page