A small package to generate features from acoustic

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

B2AI Prep

A simple Python package to prepare acoustic data for the Bridge2AI voice project.

Caution: this package is still under development and may change rapidly over the next few weeks.

Installation

Requires a Python >= 3.10 environment

pip install b2aiprep

Usage

Two commands are available through the CLI:

b2aiprep-cli --help

Convert an audio file to features:

The simplest form takes an audio file, a subject id, and a task name.

b2aiprep-cli convert test_audio.wav s1 mpt

It will save a pytorch .pt file with a dictionary of features. This can be loaded by torch.load(). The file is named following a simple convention: sub-<subject_id>_task-<task_name>_md5-<checksum>_features.pt

Batch process audio files

This requires a CSV file, where each line is of the form: path/to/audio.wav,subject_id,task_name

b2aiprep-cli batchconvert filelist.csv --plugin cf n_procs=2 --outdir out

The above command uses pydra under the hood to parallel process the audio files. All outputs are currently stored in a single directory specified by the --outdir flag.

Verify if two audio files are from the same speaker

b2aiprep-cli test_audio1.wav test_audio2.wav --model 'speechbrain/spkrec-ecapa-voxceleb'

This will use the speechbrain speaker recognition model to verify that the two audio files are from the same speaker.

There is a notebook in the docs directory that can be used to interact with the library programmatically.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.14.2

May 9, 2024

0.14.1

Apr 23, 2024

0.14.0

Apr 19, 2024

0.13.0

Apr 18, 2024

0.12.1

Apr 18, 2024

0.12.0

Apr 17, 2024

0.11.0

Apr 12, 2024

0.10.0

Apr 10, 2024

0.9.0

Apr 8, 2024

0.8.1

Apr 8, 2024

0.8.0

Apr 1, 2024

0.7.1

Mar 29, 2024

0.7.0

Mar 28, 2024

0.5.0

Mar 28, 2024

0.4.0

Mar 27, 2024

This version

0.3.0

Mar 18, 2024

0.2.0

Mar 15, 2024

0.1.2

Mar 15, 2024

0.1.1

Mar 15, 2024

0.1.0

Mar 15, 2024

0.0.1

Mar 14, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

b2aiprep-0.3.0.tar.gz (32.6 kB view hashes)

Uploaded Mar 18, 2024 Source

Built Distribution

b2aiprep-0.3.0-py3-none-any.whl (13.2 kB view hashes)

Uploaded Mar 18, 2024 Python 3

Hashes for b2aiprep-0.3.0.tar.gz

Hashes for b2aiprep-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`459f019213de4060b5f37f21c83d006c69b973e952e9d26b524ff9f67a431b3c`
MD5	`29ea2e2f8141224980ae4d11d0bd5572`
BLAKE2b-256	`9392cc71ac420c4b079b888df6cd6761bca388b46af007402172a7ffc7d84ff6`

Hashes for b2aiprep-0.3.0-py3-none-any.whl

Hashes for b2aiprep-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1f56978eb88b55bb5ec0fb7a6432410705c12ccdae8c7e67fbe5c854af200dc8`
MD5	`8361301ce84c5ffe0ba93ad2a827df10`
BLAKE2b-256	`5ecc4396e2c6d0766f809d7b7f3df7fbeea95cb278ea78a9b74233172f9b4cd3`