No project description provided
Project description
Biocarta
Creating Cartographic Representations of Biological Data
Installation
pip install biocarta
Example code
if __name__ == '__main__' :
from biocarta.quantification import full_mapping
#
adf = pd.read_csv('analytes.tsv',sep='\t',index_col=0)
#
# WE DO NOT WANT TO KEEP POTENTIALLY BAD ENTRIES
adf = adf.iloc[ np.inf != np.abs( 1.0/np.std(adf.values,1) ) ,
np.inf != np.abs( 1.0/np.std(adf.values,0) ) ].copy()
#
# READING IN SAMPLE INFORMATION
# THIS IS NEEDED FOR THE ALIGNED PCA TO WORK
jdf = pd.read_csv('journal.tsv',sep='\t',index_col=0)
jdf = jdf.loc[:,adf.columns.values]
#
alignment_label , sample_label = 'Disease' , None
add_labels = ['Cell-line']
#
cmd = 'max'
# WRITE FILES AND MAKE NOISE
bVerbose = True
# CREATE AN OPTIMIZED REPRESENTATION
bExtreme = True
# WE MIGHT WANT SOME SPECIFIC INTERSECTIONS OF THE HIERARCHY
n_clusters = [20,40,60,80,100]
# USE ALL INFORMATION
n_components = None
umap_dimension = 2
n_neighbors = 20
local_connectivity = 20.
transform_seed = 42
#
print ( adf , jdf )
#
# distance_type = 'correlation,spearman,absolute' # DONT USE THIS
distance_type = 'covariation' # BECOMES CO-EXPRESSION BASED
#
results = full_mapping ( adf , jdf ,
bVerbose = bVerbose ,
bExtreme = bExtreme ,
n_clusters = n_clusters ,
n_components = n_components ,
distance_type = distance_type ,
umap_dimension = umap_dimension ,
umap_n_neighbors = n_neighbors ,
umap_local_connectivity = local_connectivity ,
umap_seed = transform_seed ,
hierarchy_cmd = cmd ,
add_labels = add_labels ,
alignment_label = alignment_label ,
sample_label = None )
#
map_analytes = results[0]
map_samples = results[1]
hierarchy_analytes = results[2]
hierarchy_samples = results[3]
or just call it using the default values:
import pandas as pd
import numpy as np
if __name__ == '__main__' :
from biocarta.quantification import full_mapping
#
adf = pd.read_csv('analytes.tsv',sep='\t',index_col=0)
#
adf = adf.iloc[ np.inf != np.abs( 1.0/np.std(adf.values,1) ) ,
np.inf != np.abs( 1.0/np.std(adf.values,0) ) ].copy()
jdf = pd.read_csv('journal.tsv',sep='\t',index_col=0)
jdf = jdf.loc[:,adf.columns.values]
#
alignment_label , sample_label = 'Disease' , None
add_labels = ['Cell-line']
#
results = full_mapping ( adf , jdf ,
bVerbose = True ,
n_clusters = [40,80,120] ,
add_labels = add_labels ,
alignment_label = alignment_label )
#
map_analytes = results[0]
map_samples = results[1]
hierarchy_analytes = results[2]
hierarchy_samples = results[3]
and plotting the information of the map analytes yields : Cancer Disease Example
You can also run an alternative algorithm where the UMAP coordinates are employed directly for clustering by setting
results = full_mapping ( adf , jdf ,
bVerbose = True ,
bUseUmap = True ,
n_clusters = [40,80,120] ,
add_labels = add_labels ,
alignment_label = alignment_label )
with the following results.
Download the zip and open the html index:
chromium index.html
Other generated solutions
The clustering visualisations were created using the Biocarta and hvplot :
What groupings corresponds to biomarker variance that describe them? Here are two visualisations of that:
Diseases : cancers
Tissues : tissues
Single Cells: single cells
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file biocarta-0.2.20.tar.gz.
File metadata
- Download URL: biocarta-0.2.20.tar.gz
- Upload date:
- Size: 15.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.8.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1454589020cc581e190e6b257d4b54108cd77a5b4f5cdae507da29ffce8b2a46
|
|
| MD5 |
7cd98464d93c4b92d7a0a5a93e339633
|
|
| BLAKE2b-256 |
4d7c623969ed11abc12fed1ad91f30155ab23dbef39167203ffcfa8664697750
|
File details
Details for the file biocarta-0.2.20-py3-none-any.whl.
File metadata
- Download URL: biocarta-0.2.20-py3-none-any.whl
- Upload date:
- Size: 17.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.8.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
819daa31d4a80007b98783babf7bc541c3749c97f62ef1b384366309f8aed357
|
|
| MD5 |
792ffa490217974c3a6d470c7bb0928f
|
|
| BLAKE2b-256 |
0e66e935d04683b1cf28b91f632c05f10a739df792d2606cb4d7a58d6a0ad6db
|