Skip to main content

Parsing library for extracting data frames of genomic features from GTF files

Project description

Coverage Status PyPI

gtfparse

Parsing tools for GTF (gene transfer format) files.

Example usage

Parsing all rows of a GTF file into a Pandas DataFrame

from gtfparse import read_gtf

# returns GTF with essential columns such as "feature", "seqname", "start", "end"
# alongside the names of any optional keys which appeared in the attribute column
df = read_gtf("gene_annotations.gtf")

# filter DataFrame to gene entries on chrY
df_genes = df[df["feature"] == "gene"]
df_genes_chrY = df_genes[df_genes["seqname"] == "Y"]

Getting gene FPKM values from a StringTie GTF file

from gtfparse import read_gtf

df = read_gtf(
    "Transcripts.gtf",
    column_converters={"FPKM": float})

gene_fpkms = {
    gene_name: fpkm
    for (gene_name, fpkm, feature)
    in zip(df["seqname"], df["FPKM"], df["feature"])
    if feature == "gene"
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gtfparse-2.4.1.tar.gz (17.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gtfparse-2.4.1-py3-none-any.whl (15.4 kB view details)

Uploaded Python 3

File details

Details for the file gtfparse-2.4.1.tar.gz.

File metadata

  • Download URL: gtfparse-2.4.1.tar.gz
  • Upload date:
  • Size: 17.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for gtfparse-2.4.1.tar.gz
Algorithm Hash digest
SHA256 dbb6dd88bc5535b1d6ca8fab3154f3408c9b67d7215bbc1762d8f855919cc8a5
MD5 7b3cf45c40178c530eff3b3cbf49df61
BLAKE2b-256 28dbe11e3cab72334d11d83b4f60c4d816eaf2dd43d4ec29b9563f75aaa3d2a2

See more details on using hashes here.

File details

Details for the file gtfparse-2.4.1-py3-none-any.whl.

File metadata

  • Download URL: gtfparse-2.4.1-py3-none-any.whl
  • Upload date:
  • Size: 15.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for gtfparse-2.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 31ebc67d33264d0e37b9a67959614099c99e2a63df042f26ec5d81c72c11d249
MD5 865e9f248f602bb68016a99e2a3a1986
BLAKE2b-256 a597ce29851eb198dad398c9657f3ad3e33b9f368775fd62784d03841835f3fc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page