Skip to main content

Document for pMTnet Omni

Project description

pMTnet Omni: pan-MHC and cross-Species Prediction of T Cell Receptor-Antigen Binding :microscope:

Logo

forthebadge forthebadge

Package Documentation Code Coverage
pMTnet Omni codecov
pMTnet Omni Document Documentation Status codecov

pMTnet Omni is a deep learning algorithm for affinity prediction based on TCR Va, Vb, CDR3a, CDR3b sequences, peptide sequence, and MHC allele types. The predictions can be made for human and mouse alleles, and for both CD8 T cells/MHC class I and CD4 T cells/MHC class II.

Please refer to our paper for more details: pMTnet Omni paper link here

We host the online tool on DBAI, where you can find all the members of the pMTnet family, including pMTnet V1.

We have also built a detailed online documentation where we guide you step-by-step on how to format your data so it can be accpted by our algorithm.

NOTE: This is the documentation for the data curation supporting tool for pMTnet Omni. Use this BEFORE you upload your dataset to DBAI.

Model Overview

Model Overview

Dependencies

  • numpy==1.22.4
  • pandas==1.5.2
  • tqdm==4.64.1
  • torch==1.13.1
  • fair-esm==2.0.0

Enviroment Setup

conda env create -f pMTnet_Omni_Document_env.yml

Installation

conda activate pMTnet_Omni_Document
pip install pMTnet_Omni_Document

Quick Start Guide

  1. Prepare your dataset so that it looks somewhat like the following: Sample df Along with the main program, we also published 5 datasets under the ./validation_data folder. Feel free to use those datasets to check if you TCR namings, Amino Acid sequences, and MHC namings conform with our standard.

NOTE: When both TCR names (resp. MHC) and the TCR sequences (resp. MHC sequences) are provided, we will disregard the sequences. If the names can NOT be found in our reference database, the record WILL be dropped.

NOTE: On the other hand, if the names are NOT provided, we will use the sequences with minimal curation.

  1. Say your dataset is under ./df.csv. In your terminal, run
conda activate pMTnet_Omni_Document

python -m pMTnet_Omni_Document --file_path ./df.csv --output_folder_path ./
  1. Go to our website and upload your data including the .json file.

  2. An example output would look like this: Sample output

For a more in-depth explanation on input format, check out our online documentation.

CITATION

We have uploaded our article to bioRxiv. To cite

@article {Han2023.12.01.569599,
	author = {Yi Han and Yuqiu Yang and Yanhua Tian and Farjana J. Fattah and Mitchell S. von Itzstein and Minying Zhang and Xiongbin Kang and Donghan M. Yang and Jialiang Liu and Yaming Xue and Chaoying Liang and Indu Raman and Chengsong Zhu and Olivia Xiao and Yifei Hu and Jonathan E. Dowell and Jade Homsi and Sawsan Rashdan and Shengjie Yang and Mary E. Gwin and David Hsiehchen and Yvonne Gloria-McCutchen and Ke Pan and Fangjiang Wu and Don Gibbons and Xinlei Wang and Cassian Yee and Junzhou Huang and Alexandre Reuben and Chao Cheng and Jianjun Zhang and David E. Gerber and Tao Wang},
	title = {pan-MHC and cross-Species Prediction of T Cell Receptor-Antigen Binding},
	elocation-id = {2023.12.01.569599},
	year = {2023},
	doi = {10.1101/2023.12.01.569599},
	publisher = {Cold Spring Harbor Laboratory},
	URL = {https://www.biorxiv.org/content/early/2023/12/12/2023.12.01.569599},
	eprint = {https://www.biorxiv.org/content/early/2023/12/12/2023.12.01.569599.full.pdf},
	journal = {bioRxiv}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pMTnet_Omni_Document-0.0.16.tar.gz (86.8 kB view hashes)

Uploaded Source

Built Distribution

pMTnet_Omni_Document-0.0.16-py3-none-any.whl (85.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page