BiGAnts - a package for network-constrained biclustering of omics data

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

BiGAnts: network-constrained biclustering of patients and multi-omics data

PyPI package for conjoint clustering of networks and omics data

An application example is given in the file script_main.py in the project's GitHub.

To install the package please run: pip install bigants

Data input

The algorithm needs as an input one CSV matrix with gene expression/methylation/any other numerical data and one TSV file with a network.

Numerical data

Numerical data is accepted in the following format:

genes as rows.
patients as columns.
first column - genes IDs (can be any IDs).

For instance:

	Unnamed: 0	GSM748056	GSM748059	...	GSM748278	GSM748279	GSM1465989
0	1454	0.053769	0.117412	...	-0.392363	-1.870838	-1.432554
1	201931	-0.618279	0.278637	...	0.803541	-0.514947	2.361925
2	8761	0.215820	-0.343865	...	0.700430	0.073281	-0.977656
3	2703	-0.504701	1.295049	...	1.861972	0.601808	0.191013
4	26207	-0.626415	-0.646977	...	2.331724	2.339122	-0.100924

There are 2 examples of gene expression datasets that can be placed in the "data" folder

GSE30219 - a Non-Small Cell Lung Cancer dataset from GEO for patients with either adenocarcinoma or squamous cell carcinoma.
TCGA pan-cancer dataset with patients that have luminal or basal breast cancer. Both can be found here

Network

An interaction network should be present as a TSV table with two columns that represent two interacting genes. Without a header!

For instance:

	6416	2318
0	6416	5371
1	6416	351
2	6416	409
3	6416	5932
4	6416	1956

In the data folder (on the GitHub page of the project) there is an example of a PPI network from Bioigrid with experimentally validated interactions.

Functions

bigants.data_preprocessing(path_expr, path_net, log2 = False, size = 2000)

Parameters:

path_to_expr: string, path to the numerical data
path_to_net: string, path to the network file
log2: bool, (default = False), indicates if log2 transformation should be applied to the data
size: int, optional (default = 2000) determines the number of genes that should be pre-selected by variance for the analysis. Shouldn't be higher than 5000.

Returns:

GE: pandas data frame, processed expression data
G: networkX graph, processed network data
labels: dict, for mapping between real genes/patients IDs and the internal ones
rev_labels: dict, additional dictionary for mapping between real genes/patients IDs and the internal ones

bigants.BiGAnts(GE,G,L_g_min,L_g_max) creates a model for the given data:

Parameters:

GE: pandas dataframe, processed expression data
G: networkX graph, processed network data
L_g_min: int, minimal solution subnetwork size
L_g_max: int, maximal solution subnetwork size

Methods:

bigants.BiGAnts.run(self, n_proc = 1, K = 20, evaporation = 0.5, show_plot = False)

K: int, default = 20, number of ants. Fewer ants - less space exploration. Usually set between 20 and 50
n_proc: int, default = 1, number of processes that should be used
evaporation, float, default = 0.5, the rate at which pheromone evaporates
show_plot: bool, default = False, set true if convergence plots should be during the analysis

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.2.2

Jan 21, 2020

2.2.1

Dec 19, 2019

2.2.0

Dec 10, 2019

0.2.1

Dec 10, 2019

0.1.14

Nov 28, 2019

0.1.13

Nov 28, 2019

0.1.12

Nov 25, 2019

0.1.11

Nov 10, 2019

0.1.10

Nov 5, 2019

0.1.9

Nov 5, 2019

0.1.8

Nov 4, 2019

0.1.6

Nov 4, 2019

0.1.4

Nov 4, 2019

0.1.3

Nov 4, 2019

0.1.2

Oct 31, 2019

0.1.1

Oct 31, 2019

0.1.0

Oct 28, 2019

0.0.11

Oct 28, 2019

0.0.10

Oct 28, 2019

This version

0.0.9

Oct 28, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bigants-0.0.9.tar.gz (12.6 kB view hashes)

Uploaded Oct 28, 2019 Source

Built Distribution

bigants-0.0.9-py3.7.egg (21.9 kB view hashes)

Uploaded Oct 28, 2019 Source

Hashes for bigants-0.0.9.tar.gz

Hashes for bigants-0.0.9.tar.gz
Algorithm	Hash digest
SHA256	`a272363c2f002961b3c8f6bd49fe3710343ba554a3cc09f59feb2372381416bf`
MD5	`b9a1a1c4504029f4c5613a0f7bcbdd55`
BLAKE2b-256	`45d41b9895a6c5ed7940ee2cb07d0633858013232219d17444ef784fe1db118a`

Hashes for bigants-0.0.9-py3.7.egg

Hashes for bigants-0.0.9-py3.7.egg
Algorithm	Hash digest
SHA256	`76df92f87020305669ddc178d05c194eddb7387e287634606d5a3f092cf83e6a`
MD5	`8f43cae52835d2410cd3ecd2e7d62fa5`
BLAKE2b-256	`a1c16214be5529c7e8ab06ea2e38c7e2f732f588b4ce3cfbac28c68f0d3649c0`