ImagingDataCommons index to query and download data.
Project description
idc-index-data
About
idc-index-data is a Python package providing index files to query and download
data hosted by the
NCI Imaging Data Commons (IDC).
The PyPI package bundles a core set of index files (Parquet, JSON schemas, SQL queries). Supplementary indices that are too large for PyPI distribution are published as release artifacts on GitHub and uploaded to a public Google Cloud Storage bucket on each release.
Index files
The package provides metadata for the following indices via the INDEX_METADATA
dictionary:
| Index | In PyPI package | Description |
|---|---|---|
idc_index |
yes | Core IDC DICOM study-level index |
prior_versions_index |
yes | Historical version tracking |
collections_index |
yes | Collection-level metadata |
analysis_results_index |
yes | Analysis results metadata |
clinical_index |
- | Clinical data (large) |
sm_index |
- | Slide microscopy index (large) |
sm_instance_index |
- | Slide microscopy instance-level index (large) |
seg_index |
- | Segmentation index |
ann_index |
- | Annotation index |
ann_group_index |
- | Annotation group index |
contrast_index |
- | Contrast agent index |
Additionally, a gdc_idc_mapping.parquet file mapping IDC patients to the
Genomic Data Commons (GDC) is generated and published
alongside the index files (not included in the PyPI package).
All index files (including supplementary ones) are available from:
- GitHub Releases: attached as release assets
- Google Cloud Storage:
gs://idc-index-data-artifacts/<version>/(publicly readable, e.g.https://storage.googleapis.com/idc-index-data-artifacts/23.5.0/idc_index.parquet)
Usage
This package is intended to be used by the idc-index Python package.
import idc_index_data
# Access core index file paths
idc_index_data.IDC_INDEX_PARQUET_FILEPATH
idc_index_data.PRIOR_VERSIONS_INDEX_PARQUET_FILEPATH
# Access unified metadata for all indices
idc_index_data.INDEX_METADATA["idc_index"]["parquet_filepath"]
idc_index_data.INDEX_METADATA["idc_index"]["schema"] # pre-loaded dict
idc_index_data.INDEX_METADATA["idc_index"]["sql"] # pre-loaded string
Acknowledgment
This software is maintained by the IDC team, which has been funded in whole or in part with Federal funds from the NCI, NIH, under task order no. HHSN26110071 under contract no. HHSN261201500003l.
If this package helped your research, we would appreciate if you could cite IDC paper below.
Fedorov, A., Longabaugh, W. J. R., Pot, D., Clunie, D. A., Pieper, S. D., Gibbs, D. L., Bridge, C., Herrmann, M. D., Homeyer, A., Lewis, R., Aerts, H. J. W., Krishnaswamy, D., Thiriveedhi, V. K., Ciausu, C., Schacherer, D. P., Bontempi, D., Pihl, T., Wagner, U., Farahani, K., Kim, E. & Kikinis, R. National Cancer Institute Imaging Data Commons: Toward Transparency, Reproducibility, and Scalability in Imaging Artificial Intelligence. RadioGraphics (2023). https://doi.org/10.1148/rg.230180
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file idc_index_data-23.8.0.tar.gz.
File metadata
- Download URL: idc_index_data-23.8.0.tar.gz
- Upload date:
- Size: 85.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5da1a24862603c5a2a79e83be9f2a37a2dd3fbc82754d1b296bfae4c2756cb5e
|
|
| MD5 |
ff25722b0b0345c408699623ed2301ad
|
|
| BLAKE2b-256 |
d1986d7d2dbcec527a49f4e3672a92337947aa6c36b4469424160e5f0a98c9c6
|
Provenance
The following attestation bundles were made for idc_index_data-23.8.0.tar.gz:
Publisher:
cd.yml on ImagingDataCommons/idc-index-data
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
idc_index_data-23.8.0.tar.gz -
Subject digest:
5da1a24862603c5a2a79e83be9f2a37a2dd3fbc82754d1b296bfae4c2756cb5e - Sigstore transparency entry: 1058145639
- Sigstore integration time:
-
Permalink:
ImagingDataCommons/idc-index-data@31052373b857aeb6a58c7c917b1caeffa51810c7 -
Branch / Tag:
refs/tags/23.8.0 - Owner: https://github.com/ImagingDataCommons
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
cd.yml@31052373b857aeb6a58c7c917b1caeffa51810c7 -
Trigger Event:
release
-
Statement type:
File details
Details for the file idc_index_data-23.8.0-py3-none-any.whl.
File metadata
- Download URL: idc_index_data-23.8.0-py3-none-any.whl
- Upload date:
- Size: 85.5 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3f4c5041ffbef90ff2b58dec8d624d721735bb0978fc46835f9fcbeff4b0667e
|
|
| MD5 |
63c7dd247c22b0957cbf23cd69b9d284
|
|
| BLAKE2b-256 |
4eae24f0709e7ebf4cf34498c2033b40ee432a30eb99b94867f07cc81f5def65
|
Provenance
The following attestation bundles were made for idc_index_data-23.8.0-py3-none-any.whl:
Publisher:
cd.yml on ImagingDataCommons/idc-index-data
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
idc_index_data-23.8.0-py3-none-any.whl -
Subject digest:
3f4c5041ffbef90ff2b58dec8d624d721735bb0978fc46835f9fcbeff4b0667e - Sigstore transparency entry: 1058145656
- Sigstore integration time:
-
Permalink:
ImagingDataCommons/idc-index-data@31052373b857aeb6a58c7c917b1caeffa51810c7 -
Branch / Tag:
refs/tags/23.8.0 - Owner: https://github.com/ImagingDataCommons
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
cd.yml@31052373b857aeb6a58c7c917b1caeffa51810c7 -
Trigger Event:
release
-
Statement type: