NIPT analysis pipeline
Project description
FluFFyPipe
NIPT analysis pipeline, using WisecondorX for detecting aneuplodies and large CNVs, AMYCNE for FFY and PREFACE for FF prediction (optional). FluFFYPipe produces a variety of output files, as well as a per batch csv summary.
Run FluFFyPipe
Run NIPT analysis, using a previously comnputed reference:
fluffy --sample <samplesheet> --project <input_folder> --out <output_folder> analyse
Run NIPT analysis, using an internally computed reference (i.e the reference is built using all samples listed in samplesheet):
fluffy --sample <samplesheet> --project <input_folder> --out <output_folder> analyse --batch-ref
optionally, skip preface:
fluffy --sample <samplesheet> --project <input_folder> --out <output_folder> --skip_preface analyse
All output will be written to the output folder, this output includes:
bam files
wisecondorX output
tiddit coverage summary
Fetal fraction estimation
as well as a summary csv and multiqc html (per batch)
the input folder is a project folder containing one folder per sample, each of these subfolders contain the fastq file(s). The samplesheet contains at least a "sampleID" column, the sampleID should match the subfolders in the input folder. The samplesheet may contain other columns, such as flowcell and index folder: such columns will be printed to the summary csv. If the samplesheet contains a SampleName column, fluffy will name the output according to SampleName
Create a WisecondorX reference
fluffy --sample <samplesheet> --project <input_folder> --out <output_folder> reference
samplesheet should contain atleast a "sampleID" column. All samples in the samplesheet will be used to construct the reference, visit the WisecondorX manual for more information.
Troubleshooting and rerun
There are three statuses of the fluffy pipeline: running, complete, and failed
The status of a fluffy run is found in the
<output_folder>/analysis_status.json
The status of all jobs are listed in
<output_folder>/sacct/fluffy_<date>.log.status
Where is the timepoint when the jobs were submitted Use grep to find the failed jobs:
grep -v COMPLETE <output_folder>/sacct/fluffy_<date>.log.status
The output logs are stored in:
<output_folder>/logs
Before continuing, you may want to generate the summary csv for all completed cases:
bash <output_folder>/scripts/summarizebatch-<hash>
where is a randomly generated string.
use the rerun module to rerun failed fluffy analyses:
fluffy --sample <samplesheet> --project <input_folder> --out <output_folder> --skip_preface rerun
Install FluFFyPipe
FluFFyPipe requires python 3, slurm, slurmpy, and singularity, python-coloredlogs.
fluffy may be installed using pip:
pip install fluffy-cg
alternatively, fluffy is cloned and installed from github: git clone https://github.com/Clinical-Genomics/fluffy cd fluffy pip install -e .
Next download the FluFFyPipe singularity container
singularity pull --name FluFFyPipe.sif shub://J35P312/FluFFyPipe
copy the example config (found in example_config), and edit the variables. You will need to download/create the following files:
Reference fasta (indexed using bwa)
WisecondorX reference files (created using the reference mode)
PREFACE model file (optional)
blacklist bed file (used by wisecondorX)
FluFFyPipe singularity collection (singularity pull --name FluFFyPipe.sif shub://J35P312/FluFFyPipe)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for cg_fluffy-0.8.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a0ea21678a921ae2e2727e3df8295ff96dd63cfd2eb30a26682b4c9eaf527635 |
|
MD5 | 4f81794abd92e8a5cb217d0fe310f3b1 |
|
BLAKE2b-256 | 8ca3a8cd83af9ec05bf8e12bd8fe6bab34f35da1b0d8aabdaec753ec8974f46a |