Tool to introduce controlled degradations to audio

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

audio_degrader

Latest version: 1.2.3

Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio.

Installation

pip install audio_degrader

The program depends on sox, ffmpeg and rubberband, so you might need to install them as well. Recommended brew in OSX and apt-get in linux (for rubberband, in linux use rubberband-cli).

Usage of python package

import audio_degrader as ad
audio_file = ad.AudioFile('input.wav', './tmp_dir')
for d in ad.ALL_DEGRADATIONS.values():
    print ad.DegradationUsageDocGenerator.get_degradation_help(d)
degradations = ad.ParametersParser.parse_degradations_args([
    'normalize',
    'gain,6',
    'dr_compression,3',
    'equalize,500,10,30'])
for d in degradations:
    audio_file.apply_degradation(d)
audio_file.to_wav('output.wav')
audio_file.delete_tmp_files()

Usage of command-line tool

The script audio_degrader is installed along with the python package.

# e.g. mix with restaurant08.wav with snr=10db, then amplifies 6db, then compress dynamic range
$ audio_degrader -i input.mp3 -d mix,https://github.com/hagenw/audio-degradation-toolbox/raw/master/AudioDegradationToolbox/degradationData/PubSounds/restaurant08.wav,10 gain,6 dr_compression,3 -o out.wav

# for more details:
$ audio_degrader --help

A small set of sounds and impulse responses are installed along with the script, which can be listed with:

$ audio_degrader -l

# these relative paths can be used directly in the script too:
$ audio_degrader -i input.mp3 -d mix,sounds/applause.wav,-3 gain,6 -o out.wav

Applications

Evaluate Music Information Retrieval systems under different degrees of degradations
Prepare augmented data for training of machine learning systems

It is similar to the Audio Degradation Toolbox in Matlab by Sebastian Ewert and Matthias Mauch (for Matlab).

Some examples

# Mix input with a sound / noise (e.g. using installed resources)
$ audio_degrader -i input.wav -d mix,sounds/applause.wav,-3 -o out.wav


# Instead of paths, we can also use URLs
$ audio_degrader -i input.wav -d mix,https://www.pacdv.com/sounds/ambience_sounds/airport-security-1.mp3,-3 -o out.wav


# Microphone recording style
$ audio_degrader -i input.wav -d gain,-15 mix,sounds/ambience-pub.wav,18 convolution,impulse_responses/ir_smartphone_mic_mono.wav,0.8 dr_compression,2 equalize,50,100,-6 normalize -o out.wav


# Resample and normalize
$ audio_degrader -i input.mp3 -d resample,8000 normalize -o out.wav


# Convolution (again impulse responses can be resources, full paths or URLs)
$ audio_degrader -i input.wav -d convolution,impulse_responses/ir_classroom_mono.wav,0.7 -o out.wav
$ audio_degrader -i input.wav -d convolution,http://www.cksde.com/sounds/month_ir/FLANGERSPACE%20E001%20M2S.wav,0.7 -o out.wav

Audio formats

Input

audio_degrader relies on ffmpeg for audio reading, so it can read any format (even video).

Output

audio_degrader output format is always wav stereo pcm_f32le (sample rate from original audio file).

This output wav file can be easily coverted into another format with ffmpeg, e.g.:

$ ffmpeg -i out.wav -b:a 320k out.mp3
$ ffmpeg -i out.wav -ac 2 -ar 44100 -acodec pcm_s16le out_formatted.wav

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.3.1

Jan 31, 2021

1.3.0

Jan 31, 2021

This version

1.2.3

Apr 24, 2019

1.2.2

Apr 7, 2019

1.2.1

Apr 7, 2019

1.2

Apr 7, 2019

1.1.1

Aug 27, 2018

1.1

Aug 26, 2018

1.0.5

May 7, 2018

1.0.4

May 6, 2018

1.0.3

May 6, 2018

1.0.2

May 6, 2018

1.0.1

May 6, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audio_degrader-1.2.3.tar.gz (24.3 kB view hashes)

Uploaded Apr 24, 2019 Source

Built Distribution

audio_degrader-1.2.3-py2-none-any.whl (19.0 MB view hashes)

Uploaded Apr 24, 2019 Python 2

Hashes for audio_degrader-1.2.3.tar.gz

Hashes for audio_degrader-1.2.3.tar.gz
Algorithm	Hash digest
SHA256	`3a4487e38ca9dc7cd7b67330ef9a7924f7f58bfa54d8bc9aa42673fbd3a0ee5b`
MD5	`a07251b0b9b10a35fdbcd73d395f69a4`
BLAKE2b-256	`da8519bc7ebaf57033575c92647efbb95f17e5e4ed944b11620f2938c7079d94`

Hashes for audio_degrader-1.2.3-py2-none-any.whl

Hashes for audio_degrader-1.2.3-py2-none-any.whl
Algorithm	Hash digest
SHA256	`8c1e1358c1d0bba585d159e469fcfa96729da3c05f8eecd2619eb7c68a9e775c`
MD5	`1bfa9219cf9094226d49b5214883cfd1`
BLAKE2b-256	`41dced544b854a9a441d7c1ecd0dbbf748cd638ca5e10f7bde8e7a785d6601b6`