A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

These details have not been verified by PyPI

Project links

Homepage

Project description

Audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Setup

Python version support

pip install audiomentations

Usage example

from audiomentations import Compose, AddGaussianNoise, TimeStretch, PitchShift, Shift
import numpy as np

SAMPLE_RATE = 16000

augmenter = Compose([
    AddGaussianNoise(min_amplitude=0.001, max_amplitude=0.015, p=0.5),
    TimeStretch(min_rate=0.8, max_rate=1.25, p=0.5),
    PitchShift(min_semitones=-4, max_semitones=4, p=0.5),
    Shift(min_fraction=-0.5, max_fraction=0.5, p=0.5),
])

samples = np.zeros((20,), dtype=np.float32)
samples = augmenter(samples=samples, sample_rate=SAMPLE_RATE)

Go to audiomentations/augmentations/transforms.py to see which transforms you can apply.

Version history

v0.10.1 (2020-07-27)

Improve the performance of AddBackgroundNoise and AddShortNoises by optimizing the implementation of calculate_rms.
Improve compatibility of output files written by the demo script. Thanks to xwJohn.
Fix division by zero bug in Normalize. Thanks to ZFTurbo.

v0.10.0 (2020-05-05)

Breaking change: AddImpulseResponse, AddBackgroundNoise and AddShortNoises now include subfolders when searching for files. This is useful when your sound files are organized in subfolders.
AddImpulseResponse, AddBackgroundNoise and AddShortNoises now support aiff files in addition to flac, mp3, ogg and wav
Fix filter instability bug in FrequencyMask. Thanks to kvilouras.

v0.9.0 (2020-02-20)

Disregard non-audio files when looking for impulse response files
Remember randomized/chosen effect parameters. This allows for freezing the parameters and applying the same effect to multiple sounds. Use transform.freeze_parameters() and transform.unfreeze_parameters() for this.
Fix a bug in ClippingDistortion where the min_percentile_threshold was not respected as expected.
Implement transform.serialize_parameters(). Useful for when you want to store metadata on how a sound was perturbed.
Switch to a faster convolve implementation. This makes AddImpulseResponse significantly faster.
Add a rollover parameter to Shift. This allows for introducing silence instead of a wrapped part of the sound.
Expand supported range of librosa versions
Add support for flac in AddImpulseResponse
Implement AddBackgroundNoise transform. Useful for when you want to add background noise to all of your sound. You need to give it a folder of background noises to choose from.
Implement AddShortNoises. Useful for when you want to add (bursts of) short noise sounds to your input audio.
Improve handling of empty input

v0.8.0 (2020-01-28)

Add shuffle parameter in Composer
Add Resample transformation
Add ClippingDistortion transformation
Add SmoothFadeTimeMask as alternative to TimeMask

Thanks to askskro

v0.7.0 (2020-01-14)

Add new transforms:

AddImpulseResponse
FrequencyMask
TimeMask
AddGaussianSNR

Thanks to karpnv

v0.6.0 (2019-05-27)

Implement peak normalization

v0.5.0 (2019-02-23)

Implement Shift transform
Ensure p is within bounds

v0.4.0 (2019-02-19)

Implement PitchShift transform
Fix output dtype of AddGaussianNoise

v0.3.0 (2019-02-19)

Implement leave_length_unchanged in TimeStretch

v0.2.0 (2019-02-18)

Add TimeStretch transform
Parametrize AddGaussianNoise

v0.1.0 (2019-02-15)

Initial release. Includes only one transform: AddGaussianNoise

Development

Install the dependencies specified in requirements.txt

Code style

Format the code with black

Run tests and measure code coverage

pytest

Generate demo sounds for empirical evaluation

python -m demo.demo

Alternatives

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.43.1

Sep 13, 2025

0.43.0

Sep 9, 2025

0.42.0

Jul 4, 2025

0.41.0

May 5, 2025

0.40.0

Mar 20, 2025

0.39.0

Feb 12, 2025

0.38.0

Dec 6, 2024

0.37.0

Sep 3, 2024

0.36.1

Aug 20, 2024

0.36.0

Jun 10, 2024

0.35.0

Mar 15, 2024

0.34.1

Nov 24, 2023

0.33.0

Aug 30, 2023

0.32.0

Aug 15, 2023

0.31.0

Jun 21, 2023

0.30.0

May 2, 2023

0.29.0

Mar 15, 2023

0.28.0

Jan 12, 2023

0.27.0

Sep 13, 2022

0.26.0

Aug 19, 2022

0.25.1

Jun 15, 2022

0.25.0

May 30, 2022

0.24.0

Mar 18, 2022

0.23.0

Mar 7, 2022

0.22.0

Feb 18, 2022

0.21.0

Feb 10, 2022

0.20.0

Nov 18, 2021

0.19.0

Oct 18, 2021

0.18.0

Aug 5, 2021

0.17.0

Jun 25, 2021

0.16.0

Feb 11, 2021

0.15.0

Dec 10, 2020

0.14.0

Dec 6, 2020

0.13.0

Nov 10, 2020

0.12.1

Sep 28, 2020

0.12.0

Sep 23, 2020

0.11.0

Aug 27, 2020

This version

0.10.1

Jul 27, 2020

0.10.0

May 5, 2020

0.9.0

Feb 20, 2020

0.8.0

Jan 28, 2020

0.7.0

Jun 14, 2019

0.6.0

May 27, 2019

0.5.0

Feb 23, 2019

0.4.0

Feb 19, 2019

0.3.0

Feb 19, 2019

0.2.0

Feb 18, 2019

0.1.0

Feb 15, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiomentations-0.10.1.tar.gz (12.9 kB view details)

Uploaded Jul 27, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

audiomentations-0.10.1-py3-none-any.whl (13.2 kB view details)

Uploaded Jul 27, 2020 Python 3

File details

Details for the file audiomentations-0.10.1.tar.gz.

File metadata

Download URL: audiomentations-0.10.1.tar.gz
Upload date: Jul 27, 2020
Size: 12.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0.post20200714 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.3

File hashes

Hashes for audiomentations-0.10.1.tar.gz
Algorithm	Hash digest
SHA256	`73b0b73aadee0f8ec0ec228a83cb9d79166d7e5fcf74493bcf3be0089e6cd013`
MD5	`987ecbda075d82d5cabe9dcf8a1629a8`
BLAKE2b-256	`135267b15061bed95409c163bf27326d6c1740be0d1026694e3e77b00bfa0fd8`

See more details on using hashes here.

File details

Details for the file audiomentations-0.10.1-py3-none-any.whl.

File metadata

Download URL: audiomentations-0.10.1-py3-none-any.whl
Upload date: Jul 27, 2020
Size: 13.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0.post20200714 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.3

File hashes

Hashes for audiomentations-0.10.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`623a4dffe92ebd8bd07384f50d10a20cd0db79e1a5230875c538b7bb2dcf4ea5`
MD5	`229d8215deabea99623b0d4b851400d4`
BLAKE2b-256	`312edc06d6fe0dedc561fbfe2f6ed3c3c68ac760db6a8daee05806515478234c`

See more details on using hashes here.

audiomentations 0.10.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Audiomentations

Setup

Usage example

Version history

v0.10.1 (2020-07-27)

v0.10.0 (2020-05-05)

v0.9.0 (2020-02-20)

v0.8.0 (2020-01-28)

v0.7.0 (2020-01-14)

v0.6.0 (2019-05-27)

v0.5.0 (2019-02-23)

v0.4.0 (2019-02-19)

v0.3.0 (2019-02-19)

v0.2.0 (2019-02-18)

v0.1.0 (2019-02-15)

Development

Code style

Run tests and measure code coverage

Generate demo sounds for empirical evaluation

Alternatives

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes