Skip to main content

tools for comparing biological sequences with k-mer sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Project Status: Active – The project has reached a stable, usable state and is being actively developed. License: 3-Clause BSD Documentation Gitter

DOI pyOpenSci

Bioconda install PyPI Conda Platforms Debian Testing Debian Unstable

Python 3.11 Python 3.12 Python 3.13 Build Status codecov

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.06830):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


sourmash is a k-mer analysis multitool, and we aim to provide stable, robust programmatic and command-line APIs for a variety of sequence comparisons. Some of our special sauce includes:

  • FracMinHash sketching, which enables accurate comparisons (including ANI) between data sets of different sizes
  • sourmash gather, a combinatorial k-mer approach for more accurate metagenomic profiling

Please see the sourmash publications for details.

The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Maintainers: C. Titus Brown (@ctb), Luiz C. Irber, Jr (@luizirber), and N. Tessa Pierce-Ward (@bluegenes).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using conda-forge to install sourmash:

conda install -c conda-forge sourmash-minimal

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.11 and later on Windows, Mac OS X, and Linux. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda will install everything necessary, and is our recommended installation method (see below).

Installation with conda

conda-forge is a community maintained channel for the conda package manager. installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge sourmash-minimal
$ conda activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

For questions, please open an issue on Github, or ask in our chat.

Development

Development happens on github at sourmash-bio/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.

CTB Jan 2024

Project details


Release history Release notifications | RSS feed

This version

4.9.4

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.9.4.tar.gz (25.2 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

sourmash-4.9.4-py3-none-win_amd64.whl (3.5 MB view details)

Uploaded Python 3Windows x86-64

sourmash-4.9.4-py3-none-manylinux_2_28_x86_64.whl (5.1 MB view details)

Uploaded Python 3manylinux: glibc 2.28+ x86-64

sourmash-4.9.4-py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (3.7 MB view details)

Uploaded Python 3manylinux: glibc 2.17+ ARM64

sourmash-4.9.4-py3-none-macosx_11_0_x86_64.whl (4.2 MB view details)

Uploaded Python 3macOS 11.0+ x86-64

sourmash-4.9.4-py3-none-macosx_11_0_arm64.whl (3.9 MB view details)

Uploaded Python 3macOS 11.0+ ARM64

File details

Details for the file sourmash-4.9.4.tar.gz.

File metadata

  • Download URL: sourmash-4.9.4.tar.gz
  • Upload date:
  • Size: 25.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.8

File hashes

Hashes for sourmash-4.9.4.tar.gz
Algorithm Hash digest
SHA256 28889d11041e398831875c7d17a367e3e5937b095d006752e45c7f2302f4626d
MD5 fd6f4d8eee58277f18940d87f9d8742d
BLAKE2b-256 ad1407265bede73653a2bd786b6ca9b5b1eb604e581e247505c8894401001487

See more details on using hashes here.

File details

Details for the file sourmash-4.9.4-py3-none-win_amd64.whl.

File metadata

  • Download URL: sourmash-4.9.4-py3-none-win_amd64.whl
  • Upload date:
  • Size: 3.5 MB
  • Tags: Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.8

File hashes

Hashes for sourmash-4.9.4-py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 88cbc2d7fa0ea47a9dc0d0954f16c750eeffb5324a0fdff56dff5d4f08fba09c
MD5 cade637920da7d4f667438f95323358a
BLAKE2b-256 53e88bb05337b13ae165a701555df3ca32553b4bfd5a97ee92fe58a3d515d7d4

See more details on using hashes here.

File details

Details for the file sourmash-4.9.4-py3-none-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.9.4-py3-none-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 7998eb803b1e75068256381404094c08566180a81ec60aadfada893922cc5bad
MD5 4503f7f87d70ba03bbc575372b7469f5
BLAKE2b-256 ff7b764ec39f604f812c3cef29d5ec1032c96169331eff5948253fd562501743

See more details on using hashes here.

File details

Details for the file sourmash-4.9.4-py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.9.4-py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 11906e896e68c9d7f58251bc1a539e02540d063f2bab9733dc9c6a2b0dfa3317
MD5 d9256d39ff45759abd65b2b03ae875c9
BLAKE2b-256 b3475fb20137c1acb6229f5ed5eeebeb3dbf2a895835b5edbd2585f8da110e58

See more details on using hashes here.

File details

Details for the file sourmash-4.9.4-py3-none-macosx_11_0_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.9.4-py3-none-macosx_11_0_x86_64.whl
Algorithm Hash digest
SHA256 a300a3d5adc8b0ab5b46bcfa079a250e1a92fcba446a414b61714930b4d40360
MD5 00ff67f907af9b7c87f81ea999f77d95
BLAKE2b-256 6143cb744473b38349fcef526338e9895492c8d74c03c982f8d68d5583c6604e

See more details on using hashes here.

File details

Details for the file sourmash-4.9.4-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for sourmash-4.9.4-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 23bd26c5cb538d30289303d751f417b58a542b6511daf95d133167f10344e24e
MD5 bf95065458e82a6e3faafbf0d35fff8c
BLAKE2b-256 ef11e6f653244da8c690207efc76b54d420dba561fa5c8ac80c4b07829f3ee65

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page