Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Documentation Build Status codecov DOI

Compute MinHash signatures for nucleotide (DNA/RNA) and protein sequences.

Usage:

sourmash compute *.fq.gz
sourmash compare *.sig -o distances
sourmash plot distances

We have demo notebooks on binder that you can interact with:

Binder

Sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash is a product of the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine.

Installation

We recommend using bioconda to install sourmash:

conda install sourmash

This will install the 2.0 pre-release series.

You can also use pip to install the pre-release like so:

pip install --pre sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under both Python 2.7.x and Python 3.5+. The base requirements are screed and ijson, together with a C++ development environment and the CPython development headers and libraries (for the C++ extension).

The comparison code (sourmash compare) uses numpy, and the plotting code uses matplotlib and scipy, but most of the code is usable without these.

For search and gather you also need khmer version 2.1+.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda you will need to add the bioconda channel as well as the other channels bioconda depends on. Once you have setup bioconda, you can install sourmash by running:

$ conda create -n sourmash_env sourmash python=3.6.4
$ source activate sourmash_env
$ sourmash compute -h

which will install the latest alpha release.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the library code.

Tests require py.test and can be run with make test.

Please see the developer notes for more information.


CTB Dec 2018

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-2.0.0a11.tar.gz (154.7 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

sourmash-2.0.0a11-cp37-cp37m-manylinux1_x86_64.whl (13.3 MB view details)

Uploaded CPython 3.7m

sourmash-2.0.0a11-cp37-cp37m-macosx_10_11_intel.whl (13.0 MB view details)

Uploaded CPython 3.7mmacOS 10.11+ Intel (x86-64, i386)

sourmash-2.0.0a11-cp36-cp36m-manylinux1_x86_64.whl (13.3 MB view details)

Uploaded CPython 3.6m

sourmash-2.0.0a11-cp36-cp36m-macosx_10_11_intel.whl (13.0 MB view details)

Uploaded CPython 3.6mmacOS 10.11+ Intel (x86-64, i386)

sourmash-2.0.0a11-cp35-cp35m-manylinux1_x86_64.whl (13.3 MB view details)

Uploaded CPython 3.5m

sourmash-2.0.0a11-cp35-cp35m-macosx_10_11_intel.whl (13.0 MB view details)

Uploaded CPython 3.5mmacOS 10.11+ Intel (x86-64, i386)

sourmash-2.0.0a11-cp34-cp34m-manylinux1_x86_64.whl (13.3 MB view details)

Uploaded CPython 3.4m

sourmash-2.0.0a11-cp34-cp34m-macosx_10_11_intel.whl (13.0 MB view details)

Uploaded CPython 3.4mmacOS 10.11+ Intel (x86-64, i386)

sourmash-2.0.0a11-cp27-cp27mu-manylinux1_x86_64.whl (13.3 MB view details)

Uploaded CPython 2.7mu

sourmash-2.0.0a11-cp27-cp27m-manylinux1_x86_64.whl (13.3 MB view details)

Uploaded CPython 2.7m

sourmash-2.0.0a11-cp27-cp27m-macosx_10_11_intel.whl (13.0 MB view details)

Uploaded CPython 2.7mmacOS 10.11+ Intel (x86-64, i386)

File details

Details for the file sourmash-2.0.0a11.tar.gz.

File metadata

  • Download URL: sourmash-2.0.0a11.tar.gz
  • Upload date:
  • Size: 154.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11.tar.gz
Algorithm Hash digest
SHA256 034cea32061d8e2bf7ec376dcfa0c699fa73ef4de288093c2a60260ffde647bc
MD5 365691b2a753c3b09cec8c1c0712b8d3
BLAKE2b-256 2fb7ab7dd734f71d3c378d7385c000db15282dcc0c9f16cf739ab27597d280bf

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 13.3 MB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 67cf5d1f576c6d7b95ac2dbe117e6a34373952290ea764d1addc93443e295d06
MD5 4702c6b9fee60c2ffb09dce4d672cdba
BLAKE2b-256 d9815b7e25fd01482334f70246c59d08e93a8c3bcd616045908065f242de41e2

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp37-cp37m-macosx_10_11_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp37-cp37m-macosx_10_11_intel.whl
  • Upload date:
  • Size: 13.0 MB
  • Tags: CPython 3.7m, macOS 10.11+ Intel (x86-64, i386)
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp37-cp37m-macosx_10_11_intel.whl
Algorithm Hash digest
SHA256 5376cde25792828486c8446d660833b1d93dc678ccf3e76b0e2ba4db2485c47b
MD5 5fff4e8f2e7bf30aa7afc0a85b9579df
BLAKE2b-256 f4df0046ea7837adce36d138963793fb3f8bb415faf88699f8f7f9df86f42ce7

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 13.3 MB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 7335856c59726a08409bccddf5f505cefb7d901ddcf9e64b3a0838cfd3817119
MD5 23a8ea684730868e701552bcd26ba824
BLAKE2b-256 e597ecf6a971f37a4caa0e1c193e5e17b7bb867a81a095d29f592c9c6f243c31

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp36-cp36m-macosx_10_11_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp36-cp36m-macosx_10_11_intel.whl
  • Upload date:
  • Size: 13.0 MB
  • Tags: CPython 3.6m, macOS 10.11+ Intel (x86-64, i386)
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp36-cp36m-macosx_10_11_intel.whl
Algorithm Hash digest
SHA256 8466da90f06eac00af2e410dab66abbcf22b29255169d590948f1aa6a8417ea9
MD5 1578ff07d3ecd47095f88f221531a42a
BLAKE2b-256 fb81a8d7afa2cd7b7b3c48bcbb14f8183fc555c1f9b6aeb150c4e15a69974054

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp35-cp35m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 13.3 MB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 3d54f899aef1d0fe08b677f2da58487a3fddbda08ade5fb92f0d75e7f4983331
MD5 dbb65f8ba1225eebcefd8a631d2d793e
BLAKE2b-256 dfd9b785ccbb2e829629458125a41597a4070bb433300b1c2722f81054ed5b44

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp35-cp35m-macosx_10_11_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp35-cp35m-macosx_10_11_intel.whl
  • Upload date:
  • Size: 13.0 MB
  • Tags: CPython 3.5m, macOS 10.11+ Intel (x86-64, i386)
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp35-cp35m-macosx_10_11_intel.whl
Algorithm Hash digest
SHA256 71bdd89dcaa4d53cfa701a0d2c7e9e4291583d03db0a865954fda432a74c457a
MD5 4f00e02fe3e0836551e86712111ccd08
BLAKE2b-256 1057e6ebdcca2cfd8f9a14d6ceac244e9403a26e6b30e732a760aeb65c9ffd9d

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp34-cp34m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp34-cp34m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 13.3 MB
  • Tags: CPython 3.4m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp34-cp34m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 b66d6b2fc5a0dd541701eeb86896818447dd9922493bc9f78836f2b49b8c0298
MD5 ba59124b477a187d95f045c886684bc7
BLAKE2b-256 ffd1d61bc1306d8558a8b55c1e3c37dd4701331d133ca17c07a763119ccc55af

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp34-cp34m-macosx_10_11_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp34-cp34m-macosx_10_11_intel.whl
  • Upload date:
  • Size: 13.0 MB
  • Tags: CPython 3.4m, macOS 10.11+ Intel (x86-64, i386)
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp34-cp34m-macosx_10_11_intel.whl
Algorithm Hash digest
SHA256 cb571a4ee01f171001fcc285f766bde091c359baf2022c2dddf004c643c6ee10
MD5 e4c5bd6594b678556ee9e7a84483d8dd
BLAKE2b-256 3d2f28e3844d3e3db8c9ca9d46f75319ecc5739c20f290b35e1b65196ea3d365

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp27-cp27mu-manylinux1_x86_64.whl
  • Upload date:
  • Size: 13.3 MB
  • Tags: CPython 2.7mu
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 4108ae79be0bb9a3dbc4c906d87b827b393644c537750862c6a3ff4d289b6510
MD5 69f1822fe1c7d040c6a7d4995f5cc2f5
BLAKE2b-256 740b85fe14786a584b193e81f94b492105f32c97a31d2bc4d7a8b4f710eee001

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp27-cp27m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 13.3 MB
  • Tags: CPython 2.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 89c38ef2c46342af761c4be13c877f1b99e473800f6a221256a486e42d1126d9
MD5 85167b23393e13ece80063903877b0a3
BLAKE2b-256 34f54f92c57c34c8dfeb6e187e3982698449b0c16e2f52191df8727ca2ad366c

See more details on using hashes here.

File details

Details for the file sourmash-2.0.0a11-cp27-cp27m-macosx_10_11_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0a11-cp27-cp27m-macosx_10_11_intel.whl
  • Upload date:
  • Size: 13.0 MB
  • Tags: CPython 2.7m, macOS 10.11+ Intel (x86-64, i386)
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a11-cp27-cp27m-macosx_10_11_intel.whl
Algorithm Hash digest
SHA256 b0c4c1088c64878ac08ce1338be7f912d2af2f92bc8960c26b71e25853c486c2
MD5 df47d404f06d16ec8cb0e70a59d5e0e4
BLAKE2b-256 20bbb00df12a6ffaad9056f8f5ce50b364f6b28343034f63d43653ac232ea6af

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page