Skip to main content

🌊 Memory map WAVE or raw audio files 🌊

Project description

WaveMap logo

Manipulate huge WAVE or RAW files as numpy matrices - even if they are too large to fit into memory.

Memory mapping is a technique where files on disk are directly mapped to locations in memory and use the same logic as swap space does.

Samples from a WAVE or RAW audio file are directly memory mapped to entries in a numpy array, letting you manipulate very large audio files as if they all fit into memory at one time, and even directly change samples on disk.

Typical usage:

import wavemap

wm = wavemap('test.wav', 'r+')  # r+ means read/write
# Now you have a numpy matrix that you can use like any other

wm /= 2
# Each sample in the file is scaled by half.

API

wavemap()

wavemap(
     filename: str,
     mode: str='r',
     order: Union[str, NoneType]=None,
     always_2d: bool=False,
     dtype: Union[numpy.dtype, NoneType]=None,
     shape: Union[NoneType, int, tuple]=None,
     sample_rate: int=0,
     roffset: int=0,
     warn: Union[Callable, NoneType]='<function warn:  print to stderr>',
)

(wavemap/__init__.py, 56-121)

Memory map a WAVE file to a numpy array

Return an instance of ReadMap or WriteMap, depending on mode.

ARGUMENTS
filename

The name of the file being mapped

mode

The file is opened in this mode. Must be one of 'r', 'r+', 'c', 'w+'

In mode 'r', the default, the file is opened read-only and the numpy.darray is immutable.

In mode 'r+', the file is opened read-write and changes to the numpy.darray are automatically applied to the file.

In mode 'c', “copy-on-write”, the file is opened read-only, but the numpy.darray is not immutable: changes to the array are instead stored in memory.

In mode 'w+', “write”, the file is opened for write, and overwrites whatever else is there.

order

Samples usually get laid out in into a numpy.darray with`` shape=(N, C)`` where N is the number of audio frames, and C is the number of channels.

This is called column major order, but this can be toggled by setting the order parameter to F for Fortan or row-major row.

See https://stackoverflow.com/questions/27266338/

always_2d

If False, the default, mono WAVE files with only one channel get special treatment and are mapped to a one-dimensional vector with size=(N,).

If True, mono WAVE files are treated the same as any other file and are mapped to a two-dimensional matrix with size=(N, 1).

dtype

The numpy datatype of the samples in the file.

shape

The shape of the resulting numpy.darray. Can be a tuple, or a positive integer, or None.

sample_rate

The sample rate in Hz (cycles per second)

roffset

How many bytes in the file after the WAV data

warn

Programmers are sloppy so quite a lot of real-world WAVE files have recoverable errors in their format. warn is the function used to report those recoverable errors. By default, it’s set to print to sys.stderr but setting it to None disables errors entirely, or you can pass your own callback in

Class wavemap.RawMap

(wavemap/raw.py, 14-67)

“Memory map raw audio data from a disk file into a numpy matrix

wavemap.RawMap.__new__()

wavemap.RawMap.__new__(
     cls,
     filename: str,
     dtype: numpy.dtype,
     shape: Union[tuple, int, NoneType]=None,
     mode: str='r',
     offset: int=0,
     roffset: int=0,
     order: Union[str, NoneType]=None,
     always_2d: bool=False,
     warn: Union[Callable, NoneType]='<function warn:  print to stderr>',
)

(wavemap/raw.py, 17-67)

Memory map raw audio data from a disk file into a numpy matrix

ARGUMENTS
cls

Think of this as self. (This is because you need to implement __new__ and not __init__ when deriving from np.darray.)

filename

The name of the file being mapped

dtype

The numpy datatype of the samples in the file.

shape

The shape of the resulting numpy.darray. Can be a tuple, or a positive integer, or None.

mode

The file is opened in this mode. Must be one of 'r', 'r+', 'c', 'w+'

In mode 'r', the default, the file is opened read-only and the numpy.darray is immutable.

In mode 'r+', the file is opened read-write and changes to the numpy.darray are automatically applied to the file.

In mode 'c', “copy-on-write”, the file is opened read-only, but the numpy.darray is not immutable: changes to the array are instead stored in memory.

In mode 'w+', “write”, the file is opened for write, and overwrites whatever else is there.

offset

How many bytes in the file before the WAV data

roffset

How many bytes in the file after the WAV data

order

Samples usually get laid out in into a numpy.darray with`` shape=(N, C)`` where N is the number of audio frames, and C is the number of channels.

This is called column major order, but this can be toggled by setting the order parameter to F for Fortan or row-major row.

See https://stackoverflow.com/questions/27266338/

always_2d

If False, the default, mono WAVE files with only one channel get special treatment and are mapped to a one-dimensional vector with size=(N,).

If True, mono WAVE files are treated the same as any other file and are mapped to a two-dimensional matrix with size=(N, 1).

warn

Programmers are sloppy so quite a lot of real-world WAVE files have recoverable errors in their format. warn is the function used to report those recoverable errors. By default, it’s set to print to sys.stderr but setting it to None disables errors entirely, or you can pass your own callback in

Class wavemap.ReadMap

(wavemap/read.py, 18-84)

Memory-map an existing WAVE file into a numpy vector or matrix

wavemap.ReadMap.__new__()

wavemap.ReadMap.__new__(
     cls: Type,
     filename: str,
     mode: str='r',
     order: Union[str, NoneType]=None,
     always_2d: bool=False,
     warn: Union[Callable, NoneType]='<function warn:  print to stderr>',
)

(wavemap/read.py, 21-84)

Memory-map an existing WAVE file into a numpy matrix.

ARGUMENTS
cls

Think of this as self. (This is because you need to implement __new__ and not __init__ when deriving from np.darray.)

filename

The name of the file being mapped

mode

The file is opened in this mode. Must be one of 'r', 'r+' and 'c'.

In mode 'r', the default, the file is opened read-only and the numpy.darray is immutable.

In mode 'r+', the file is opened read-write and changes to the numpy.darray are automatically applied to the file.

In mode 'c', “copy-on-write”, the file is opened read-only, but the numpy.darray is not immutable: changes to the array are instead stored in memory.

order

Samples usually get laid out in into a numpy.darray with`` shape=(N, C)`` where N is the number of audio frames, and C is the number of channels.

This is called column major order, but this can be toggled by setting the order parameter to F for Fortan or row-major row.

See https://stackoverflow.com/questions/27266338/

always_2d

If False, the default, mono WAVE files with only one channel get special treatment and are mapped to a one-dimensional vector with size=(N,).

If True, mono WAVE files are treated the same as any other file and are mapped to a two-dimensional matrix with size=(N, 1).

warn

Programmers are sloppy so quite a lot of real-world WAVE files have recoverable errors in their format. warn is the function used to report those recoverable errors. By default, it’s set to print to sys.stderr but setting it to None disables errors entirely, or you can pass your own callback in

Class wavemap.WriteMap

(wavemap/write.py, 12-115)

“Memory-map a new wave file into a new numpy vector or matrix

wavemap.WriteMap.__new__()

wavemap.WriteMap.__new__(
     cls: Type,
     filename: str,
     dtype: numpy.dtype,
     shape: Union[NoneType, int, tuple],
     sample_rate: int,
     roffset: int=0,
     warn: Union[Callable, NoneType]='<function warn:  print to stderr>',
)

(wavemap/write.py, 15-85)

Open a memory-mapped WAVE file in write mode and overwrite any existing file.

ARGUMENTS
cls

Think of this as self. (This is because you need to implement __new__ and not __init__ when deriving from np.darray.)

filename

The name of the file being mapped

dtype

The numpy datatype of the samples in the file.

shape

The shape of the resulting numpy.darray. Can be a tuple, or a positive integer, or None.

sample_rate

The sample rate in Hz (cycles per second)

roffset

How many bytes in the file after the WAV data

warn

Programmers are sloppy so quite a lot of real-world WAVE files have recoverable errors in their format. warn is the function used to report those recoverable errors. By default, it’s set to print to sys.stderr but setting it to None disables errors entirely, or you can pass your own callback in

wavemap.convert()

wavemap.convert(
     arr: numpy.ndarray,
     dtype: Union[numpy.dtype, NoneType],
     must_copy: bool=False,
)

(wavemap/convert.py, 6-77)

Returns a copy of a numpy array or matrix that represents audio data in another type, scaling and shifting as necessary.

ARGUMENTS
arr

A numpy darray representing an audio signal

dtype

The numpy dtype to convert to - none means “no conversion”

must_copy

If true, arr is copied even if it is already the requested type

(automatically generated by doks on 2021-02-23T14:37:02.652534)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wavemap-2.1.0.tar.gz (13.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wavemap-2.1.0-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file wavemap-2.1.0.tar.gz.

File metadata

  • Download URL: wavemap-2.1.0.tar.gz
  • Upload date:
  • Size: 13.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.28 {"installer":{"name":"uv","version":"0.9.28","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for wavemap-2.1.0.tar.gz
Algorithm Hash digest
SHA256 29bd323784948db1071736ce199123c7adf6f6e7a9ef15cbd33a4485458809db
MD5 4ae7b6e0b3fac4ceaa830751d4599a1d
BLAKE2b-256 c76598e270a192125a60ce44240133209bf0f4144f809936758166cb26eeeba8

See more details on using hashes here.

File details

Details for the file wavemap-2.1.0-py3-none-any.whl.

File metadata

  • Download URL: wavemap-2.1.0-py3-none-any.whl
  • Upload date:
  • Size: 18.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.28 {"installer":{"name":"uv","version":"0.9.28","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for wavemap-2.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 052ed4561d5fa98fbf4b320ec61ab44d2b556a2f61afde26c13a54be45404230
MD5 c229b4d37048c7c6e1c2ab41afaacd8c
BLAKE2b-256 6d2599eb18f34a386bce4c1027337a82ccd8471f9a54c157964f5482b464fb87

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page