Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition.
Project description
Wenet STT Python
Beta Software
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using WeNet models for speech recognition.
Requirements:
- Python 3.7+
- Platform: Linux x64 (Windows is a work in progress; MacOS may work; PRs welcome)
- Python package requirements:
cffi
,numpy
- Wenet Model (must be "runtime" format)
- Several are available ready-to-go on this project's releases page and below.
Models:
Model | Download Size |
---|---|
gigaspeech_20210728_u2pp_conformer | 549 MB |
gigaspeech_20210811_conformer_bidecoder | 540 MB |
Usage
from wenet_stt import WenetSTT
decoder = WenetSTT(WenetSTT.build_config('model_dir'))
import wave
with wave.open('tests/test.wav', 'rb') as wav_file:
wav_samples = wav_file.readframes(wav_file.getnframes())
assert decoder.decode(wav_samples).strip().lower() == 'it depends on the context'
Also contains a simple CLI interface for recognizing wav
files:
$ python -m wenet_stt decode model test.wav
IT DEPENDS ON THE CONTEXT
$ python -m wenet_stt decode model test.wav test.wav
IT DEPENDS ON THE CONTEXT
IT DEPENDS ON THE CONTEXT
$ python -m wenet_stt -h
usage: python -m wenet_stt [-h] {decode} ...
positional arguments:
{decode} sub-command
decode decode one or more WAV files
optional arguments:
-h, --help show this help message and exit
Installation/Building
Recommended installation via wheel from pip (requires a recent version of pip):
python -m pip install wenet_stt
To build package for use locally:
python setup.py bdist_wheel
To build package for publishing:
building/dockcross-manylinux2014-x64 bash building/build-wheel-dockcross.sh manylinux2014_x86_64
Author
- David Zurow (@daanzu)
License
This project is licensed under the GNU Affero General Public License v3 (AGPL-3.0-or-later). See the LICENSE file for details. If this license is problematic for you, please contact me.
Acknowledgments
- Contains and uses code from WeNet, licensed under the Apache-2.0 License, and other transitive dependencies (see source).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for wenet_stt-0.2.0-py2.py3-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 14d466b1efcf4366978e148edf0a7f2b9cdd2ed9d0f88dcb6a82ee9ad8b6c671 |
|
MD5 | e61345730b5930ba80e26f8f1515c658 |
|
BLAKE2b-256 | 3c0233da2c9c4b6f19eea1db24122a1b353c44771b815dc624dc42ad89631a4d |
Hashes for wenet_stt-0.2.0-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c2130149d767243929f0c54162df70458898c787d0d66b1b6c9d1191935f95be |
|
MD5 | f54679bbad1e1641b013ada84b0e18d6 |
|
BLAKE2b-256 | 2b453b026c292755365374c707e69b7f12ab136f8902428ce5b3e7b487257c51 |