Convert gVCF into BED
Project description
This is a small tool to convert a gVCF file to BED. This is useful for extracting regions that pass a certain genotype quality threshold.
Installation
gvcf2bed is now available through pypi with: pip install gvcf2bed
Requirements
Python 3.4+
pyvcf
cyvcf2
For developers
pytest
pytest-cov
Changelog
0.3.1
Fix a bug for variants where GQ is not defined.
0.3
Use cyvcf2 by default in tool. This results in a speed-up of approximately 8-10x. Existing API has not changed, and will still work with pyvcf.
Add separate filter for non-variants, as GQ scores may have a different distribution on non-variant records than on variant records.
Usage
usage: gvcf2bed [-h] -I INPUT -O OUTPUT [-s SAMPLE] [-q QUALITY]
[-nq NON_VARIANT_QUALITY] [-b]
Create a BED file from a gVCF. Regions are based on a minimum genotype
quality. The gVCF file must contain a GQ field in its FORMAT fields. GQ scores
of non-variants records have a different distribution from the GQ score
distribution of variant records. Hence, an option is provided to set a
different threshold for non-variant positions.
optional arguments:
-h, --help show this help message and exit
-I INPUT, --input INPUT
Input gVCF
-O OUTPUT, --output OUTPUT
Output bed file
-s SAMPLE, --sample SAMPLE
Sample name in VCF file to use. Will default to first
sample (alphabetically) if not supplied
-q QUALITY, --quality QUALITY
Minimum genotype quality (default 20)
-nq NON_VARIANT_QUALITY, --non-variant-quality NON_VARIANT_QUALITY
Minimum genotype quality for non-variant records
(default 20)
-b, --bedgraph Output in bedgraph mode
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gvcf2bed-0.3.1.tar.gz.
File metadata
- Download URL: gvcf2bed-0.3.1.tar.gz
- Upload date:
- Size: 5.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b6e11d3561bc362ac19bbb42c1e37a9395dc43dcbcb6c444f99db29ae12fa7c2
|
|
| MD5 |
61f8398bfa03e7fc787f3907557f1de3
|
|
| BLAKE2b-256 |
834b63bb9a61a31cfc3816643ab46e44d896867dda2e57bce6c8deef1ffdfcbc
|
File details
Details for the file gvcf2bed-0.3.1-py3-none-any.whl.
File metadata
- Download URL: gvcf2bed-0.3.1-py3-none-any.whl
- Upload date:
- Size: 5.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9a1f8b1fad0bb85a0fdba6d23a17f338bee73ce8aa34348cf917620947025d9d
|
|
| MD5 |
e603ce5bc5a95d99443f8deb83de5b3f
|
|
| BLAKE2b-256 |
1a8927173d160b5fafe2195c0d1329018d2cf81a7e3c2c03e6ad47a340e458f7
|