Skip to main content

Common loaders for MIR datasets.

Project description

mirdata

common loaders for Music Information Retrieval (MIR) datasets. Find the API documentation here.

CircleCI codecov Documentation Status GitHub

This library provides tools for working with common MIR datasets, including tools for:

  • downloading datasets to a common location and format
  • validating that the files for a dataset are all present
  • loading annotation files to a common format, consistent with the format required by mir_eval
  • parsing track level metadata for detailed evaluations

Installation

To install, simply run:

pip install mirdata

Quick example

import mirdata

orchset = mirdata.initialize('orchset')
orchset.download()  # download the dataset
orchset.validate()  # validate that all the expected files are there

example_track = orchset.choice_track()  # choose a random example track
print(example_track)  # see the available data

See the documentation for more examples and the API reference.

Currently supported datasets

Supported datasets include AcousticBrainz, DALI, Guitarset, MAESTRO, TinySOL, among many others.

For the complete list of supported datasets, see the documentation

Citing

There are two ways of citing mirdata:

If you are using the library for your work, please cite the version you used as indexed at Zenodo:

DOI

If you refer to mirdata's design principles, motivation etc., please cite the following paper:

DOI

"mirdata: Software for Reproducible Usage of Datasets"
Rachel M. Bittner, Magdalena Fuentes, David Rubinstein, Andreas Jansson, Keunwoo Choi, and Thor Kell
in International Society for Music Information Retrieval (ISMIR) Conference, 2019
@inproceedings{
  bittner_fuentes_2019,
  title={mirdata: Software for Reproducible Usage of Datasets},
  author={Bittner, Rachel M and Fuentes, Magdalena and Rubinstein, David and Jansson, Andreas and Choi, Keunwoo and Kell, Thor},
  booktitle={International Society for Music Information Retrieval (ISMIR) Conference},
  year={2019}
}

When working with datasets, please cite the version of mirdata that you are using (given by the DOI above) AND include the reference of the dataset, which can be found in the respective dataset loader using the cite() method.

Contributing a new dataset loader

We welcome contributions to this library, especially new datasets. Please see contributing for guidelines.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mirdata-0.3.0.tar.gz (6.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mirdata-0.3.0-py3-none-any.whl (6.1 MB view details)

Uploaded Python 3

File details

Details for the file mirdata-0.3.0.tar.gz.

File metadata

  • Download URL: mirdata-0.3.0.tar.gz
  • Upload date:
  • Size: 6.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.9.1

File hashes

Hashes for mirdata-0.3.0.tar.gz
Algorithm Hash digest
SHA256 2874241b47366d2bf4b3ad5340ec7db8c8df5df912b6034cb7d5bf413bb34cbc
MD5 41c92ca5a236d335edcfb5551c8b98f0
BLAKE2b-256 52d3f441dc6b9b083c254367ab3237f7f6e36df12dc8d77f741b05477a8e298f

See more details on using hashes here.

File details

Details for the file mirdata-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: mirdata-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 6.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.9.1

File hashes

Hashes for mirdata-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4736b3a209422ab932846f3958eca5f89463e126009981d685d1c26c92ba53c8
MD5 ec3d5ba34c610fbeb1bc03e8bbbaa627
BLAKE2b-256 ad20989b7d3113920bea268d2de6912772faea08ae4009a92b7aa1ce080f8353

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page