Skip to main content

Calculates both scaffold and contig statistics (N50, L50, etc.) from a scaffold FASTA file.

Project description

assembly_stats

A Python library that takes a FASTA file as input and calculates both scaffold and contig statistics (N50, L50, etc.) from a scaffold FASTA file. It does this by breaking each scaffold wherever there is more than one N and then calculating statistics for both the scaffolds and contigs.

This is a re-write of fasta_metadata_parser to speed up the old implementation, and -- most importantly -- to learn how to install Python scripts onto the Smithsonian HPC.

Installation

pip install assembly_stats

Usage

  $ assembly_stats -h

    usage: assembly_stats [-h] filename

    Calculate statistics about genome assemblies.

    positional arguments:
      filename    Genome file in FASTA format.

    optional arguments:
      -h, --help  show this help message and exit

After calculating the statistics for the genome assembly, they will be printed out in JSON format.

Next steps

  • Add ability to save NumPy sequence length arrays for further visualization, since generating these are what takes the most time.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

assembly_stats-0.1.1.tar.gz (3.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

assembly_stats-0.1.1-py3-none-any.whl (5.0 kB view details)

Uploaded Python 3

File details

Details for the file assembly_stats-0.1.1.tar.gz.

File metadata

  • Download URL: assembly_stats-0.1.1.tar.gz
  • Upload date:
  • Size: 3.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for assembly_stats-0.1.1.tar.gz
Algorithm Hash digest
SHA256 3312a5c2ce15ede96ca203ab5858f5780ee0079e60c8789e9c84cd015cfaa24d
MD5 003bae78e6eb64acdf3b82a78e0acc27
BLAKE2b-256 2b02675bf5f19deab0b7dd7615812207f24e47519e3e4dff4a420071d2ba8229

See more details on using hashes here.

File details

Details for the file assembly_stats-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: assembly_stats-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 5.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for assembly_stats-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6abe98543052b511c87b63315a8d17bdda47a28b4ae48ac7e2f4180519ef4c01
MD5 426aaf47b3dda8eabc645a0a5fe05bd9
BLAKE2b-256 82c1c1faa3b6535678710e4120c5f471b6d9f74834f700dbf22d27dbb36665de

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page