Skip to main content

A small package to generate features from acoustic

Project description

B2AI Prep

A simple Python package to prepare acoustic data for the Bridge2AI voice project.

Caution: this package is still under development and may change rapidly over the next few weeks.

Installation

Requires a Python >= 3.10 environment

pip install b2aiprep`

Usage

Two commands are available through the CLI:

b2aiprep-cli --help
  1. Convert an audio file to features:

The simplest form takes an audio file, a subject id, and a task name.

b2aiprep-cli convert test_audio.wav s1 mpt

It will save a pytorch .pt file with a dictionary of features. This can be loaded by torch.load(). The file is named following a simple convention: sub-<subject_id>_task-<task_name>_md5-<checksum>_features.pt

  1. Verify if two audio files are from the same speaker
b2aiprep-cli test_audio1.wav test_audio2.wav --model 'speechbrain/spkrec-ecapa-voxceleb'

This will use the speechbrain speaker recognition model to verify that the two audio files are from the same speaker.

There is a notebook in the docs directory that can be used to interact with the library programmatically.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

b2aiprep-0.1.1.tar.gz (31.7 kB view hashes)

Uploaded Source

Built Distribution

b2aiprep-0.1.1-py3-none-any.whl (12.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page