Skip to main content

A package for computing variation numbers

Project description

Variation Number

A package for calculating the variation number of nucleotide/protein sequence using sequence orthologs.

Characteristic Attribute Organization System (CAOS) discovers rules associated with a given phylogenetic tree. A pure (Pu) rule or character attribute (CA) is a state that exists in all elements of a clade but not the alternate clade; a private (Pr) CA is present in some members of a clade but absent in the alternate clade. A variation number (VN) is defined as the number of occurrences of a position as a CA in all the tree clades.

The method is described in the publication:
Lai, J., & Sarkar, I. N. (2021). A Phylogenetic Approach to Analyze the Conservativeness of BRCA1 and BRCA2 Mutations. AMIA ... Annual Symposium proceedings. AMIA Symposium, 2020, 677–686. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8075528/

Features

  • Download orthologs
  • Build phylogenetic trees
  • Generate variation numbers

Required python packages

Python packages (most of which can be installed using pip) needed to run LYRUS include:

Required external packages

In order to run vn.py, please install command line version for:

  1. Clustal Omega: http://www.clustal.org/omega/
  2. Mafft: https://mafft.cbrc.jp/alignment/software/
  3. PAUP: http://phylosolutions.com/paup-test/
  4. FastTree: http://www.microbesonline.org/fasttree/

Running instructions for installation using pip

variation_number(0.2.5) is published on PyPI. Use the following command to install variation_number using pip:

$ pip install variation-number

Usage

import variation_number as vn
import os
gene = 'BRCA1'
seqtype =' protein'
outputDir = '{}/output'.format(os.getcwd())

# Download orthologs from NCBI orthologs database (optional; can use user provided sequence file)
acc = vn.getFasta(gene, outputDir,seqtype,refseqID=None)

# Calculate variation number using clustal omega
vn.processVN(file='{}/{}'.format(outputDir, gene), outputDir, reindex=False, accession_full=acc, seqType=seqtype, aligned=False, alignTool='clustal', treeTool='paup')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

variation_number-0.2.5.tar.gz (8.4 kB view hashes)

Uploaded Source

Built Distribution

variation_number-0.2.5-py3-none-any.whl (9.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page