Skip to main content

torch-adata

Project description

torch-adata

Create pytorch Datasets from AnnData

Example use of the base class

The base class, AnnDataset is a subclass of the widely-used torch.utils.data.Dataset. The outputs of all AnnDataset classes and subclasses are designed to be directly compatible with the torch.utils.data.DataLoader module.

import anndata as a
import torch_adata as ta

adata = a.read_h5ad("/path/to/data.h5ad")
dataset = ta.AnnDataset(adata)

Returns data (X as a torch.Tensor) and the pandas.DataFrame; adata.obs.

# create a dummy index
idx = np.random.choice(range(dataset.__len__()), 5)
X, obs = dataset.__getitem__(idx)

Specialized classes

GroupedAnnDataset

A subclass of the base class, AnnDataset.

import anndata as a
import torch_adata as ta

adata = a.read_h5ad("/path/to/data.h5ad")
dataset = ta.GroupedAnnDataset(adata, groupby="batch")

Returns data as a dictionary of data with values as torch.Tensor and keys as each groupby category and the sampled adata.obs is again returned as a pandas.DataFrame.

# create a dummy index
idx = np.random.choice(range(dataset.__len__()), 5)
X_dict, obs = dataset.__getitem__(idx)

TimeResolvedAnnDataset

A subclass of the class, GroupedAnnDataset.

import anndata as a
import torch_adata as ta

adata = a.read_h5ad("/path/to/data.h5ad")
dataset = ta.TimeResolvedAnnDataset(adata, time_key="Time point")

Returns the initial datapoint, X0 as a torch.Tensor, the entire sample of the dataset as a dictionary of data with values as torch.Tensor and keys as each timepoint indicated by the time_key. Sampled adata.obs is again returned as a pandas.DataFrame.

# create a dummy index
idx = np.random.choice(range(dataset.__len__()), 5)
X0, X_dict, t, obs = dataset.__getitem__(idx)

Installation

Install from PYPI:

pip install torch-adata

Install the developer version:

git clone https://github.com/mvinyard/torch-adata.git; cd torch-adata;
pip install -e .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torch-adata-0.0.1.tar.gz (3.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

torch_adata-0.0.1-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file torch-adata-0.0.1.tar.gz.

File metadata

  • Download URL: torch-adata-0.0.1.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.5

File hashes

Hashes for torch-adata-0.0.1.tar.gz
Algorithm Hash digest
SHA256 6952644c38f990f9840ead2193e71b0c360caf992acb5a7986719ceac07d5adc
MD5 9aa29b7af4e17cc759d34059ea12237d
BLAKE2b-256 ce4d38d675c464c46d51a98c51686bcc9893d5bffbcb105c8c2955fd7dc09592

See more details on using hashes here.

File details

Details for the file torch_adata-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: torch_adata-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 5.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.5

File hashes

Hashes for torch_adata-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 82722f9fdc6970d7eabebd5f70a1190af1253ad7ff69932f6ad74cf9e44698e4
MD5 79cf3bd2dca3ec2b2a54816bbee5a7c3
BLAKE2b-256 f783633b700543471012eae81a20d8d557532c86546fa26fb4c007bc9bc0b3f1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page