Skip to main content

A library for running inference on a DeepSpeech model

Project description

Documentation Task Status

DeepSpeech is an open source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu’s Deep Speech research paper. Project DeepSpeech uses Google’s TensorFlow to make the implementation easier.

To install and use deepspeech all you have to do is:

# Create and activate a virtualenv
virtualenv -p python3 $HOME/tmp/deepspeech-venv/
source $HOME/tmp/deepspeech-venv/bin/activate

# Install DeepSpeech
pip3 install deepspeech

# Download pre-trained English model and extract
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.5.1/deepspeech-0.5.1-models.tar.gz
tar xvf deepspeech-0.5.1-models.tar.gz

# Download example audio files
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.5.1/audio-0.5.1.tar.gz
tar xvf audio-0.5.1.tar.gz

# Transcribe an audio file
deepspeech --model deepspeech-0.5.1-models/output_graph.pbmm --lm deepspeech-0.5.1-models/lm.binary --trie deepspeech-0.5.1-models/trie --audio audio/2830-3980-0043.wav

A pre-trained English model is available for use and can be downloaded using the instructions below. Currently, only 16-bit, 16 kHz, mono-channel WAVE audio files are supported in the Python client. A package with some example audio files is available for download in our release notes.

Quicker inference can be performed using a supported NVIDIA GPU on Linux. See the release notes to find which GPUs are supported. To run deepspeech on a GPU, install the GPU specific package:

# Create and activate a virtualenv
virtualenv -p python3 $HOME/tmp/deepspeech-gpu-venv/
source $HOME/tmp/deepspeech-gpu-venv/bin/activate

# Install DeepSpeech CUDA enabled package
pip3 install deepspeech-gpu

# Transcribe an audio file.
deepspeech --model deepspeech-0.5.1-models/output_graph.pbmm --lm deepspeech-0.5.1-models/lm.binary --trie deepspeech-0.5.1-models/trie --audio audio/2830-3980-0043.wav

Please ensure you have the required CUDA dependencies.

See the output of deepspeech -h for more information on the use of deepspeech. (If you experience problems running deepspeech, please check required runtime dependencies).


Table of Contents

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

deepspeech_gpu-0.6.0a14-cp38-cp38-win_amd64.whl (17.3 MB view details)

Uploaded CPython 3.8Windows x86-64

deepspeech_gpu-0.6.0a14-cp38-cp38-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.8

deepspeech_gpu-0.6.0a14-cp37-cp37m-win_amd64.whl (17.3 MB view details)

Uploaded CPython 3.7mWindows x86-64

deepspeech_gpu-0.6.0a14-cp37-cp37m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.7m

deepspeech_gpu-0.6.0a14-cp36-cp36m-win_amd64.whl (17.3 MB view details)

Uploaded CPython 3.6mWindows x86-64

deepspeech_gpu-0.6.0a14-cp36-cp36m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.6m

deepspeech_gpu-0.6.0a14-cp35-cp35m-win_amd64.whl (17.3 MB view details)

Uploaded CPython 3.5mWindows x86-64

deepspeech_gpu-0.6.0a14-cp35-cp35m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.5m

deepspeech_gpu-0.6.0a14-cp34-cp34m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.4m

deepspeech_gpu-0.6.0a14-cp27-cp27mu-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 2.7mu

deepspeech_gpu-0.6.0a14-cp27-cp27m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 2.7m

File details

Details for the file deepspeech_gpu-0.6.0a14-cp38-cp38-win_amd64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp38-cp38-win_amd64.whl
  • Upload date:
  • Size: 17.3 MB
  • Tags: CPython 3.8, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp38-cp38-win_amd64.whl
Algorithm Hash digest
SHA256 24418e4ecc2f7422fd44ea5ef8020172d4936ba6c0289f652cab9a6a2687c62f
MD5 e73a1f8f4e3aaf9b3020d3c9f2bd13a4
BLAKE2b-256 09cc21ec099df359dc1b0601df632b596ac3139a7bcc1970dde9dc8b36dbe948

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp38-cp38-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp38-cp38-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp38-cp38-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 a32ac3db97724e2be23342e41b2564ac26314fd2d3a0cbabd68fd7cbaf4336e9
MD5 10700ef187aa15c6018276c9d062c8c5
BLAKE2b-256 f4582907177ea72effb315fa53c983bcd87d74fc09d647ac171c3f4d75f4c99a

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 17.3 MB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 329efb17df6fbcc088399e15aab554bc218128d55c18f1c4fdba31d5fbce8009
MD5 a8a7ccebc70e4c662034cd80b903c860
BLAKE2b-256 cf3e89865535b219c7528d0ec770a31fea90bddc0d01c4200822a6188e86281f

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 1f9ab00c7dceb1da3d6a908c147790456eaffb069700aaf82e4e1676b0a6abb2
MD5 8fc839c1255909ee16e67b27d6257278
BLAKE2b-256 c21e90a914af48792e5f0492ede2cf45bf7e05be370696e183e2849056a563d1

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp36-cp36m-win_amd64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp36-cp36m-win_amd64.whl
  • Upload date:
  • Size: 17.3 MB
  • Tags: CPython 3.6m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp36-cp36m-win_amd64.whl
Algorithm Hash digest
SHA256 7e5e4f4e162a82d1e67f3ebe4fb3ee8ae7348cc1c44d0fc71ff84ef42fd9b110
MD5 605abb909c85964121d147fa9a59d48f
BLAKE2b-256 808131d51dd755836f3d0bcd1d5673b0028c815d3769ef884b30bed56a2036bf

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 c3fa4e1651f57e43262e743e55ba2073b15369245df58d087ffc8ac494fc046d
MD5 3ffd7fdd73a906b09e023a432e635399
BLAKE2b-256 2c5b86e68f68de506d5e67c91dd9862eabb5708e355bc5835177812b290f65a6

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp35-cp35m-win_amd64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp35-cp35m-win_amd64.whl
  • Upload date:
  • Size: 17.3 MB
  • Tags: CPython 3.5m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp35-cp35m-win_amd64.whl
Algorithm Hash digest
SHA256 14f196c4e0ca19f86577639ccba424e26d5e615c404c307165670762876d8dff
MD5 ed2409a538cba036e108021d7a99a56d
BLAKE2b-256 eb055f55b06b0b8ad1c76d75ea5f7e67930aca72a3ba1550fdfdb47b7ca23e82

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp35-cp35m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 d9ca6de5d0c457b8e9fb27ace77a31c17886d1f624cdec1afc6fa3a66fc668f1
MD5 fc49fb8621d42a89ca1ad6b760f5e760
BLAKE2b-256 f62784790745ca8c25c3dab9262c78ba0f5472c31b3ceede360095e7d77ce0e4

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp34-cp34m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp34-cp34m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.4m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp34-cp34m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 7c80f4ed38b736e37fc21b802802cd8aac53c8e618c7993cb7ae4bda022eaa85
MD5 e0bf15b46dd9b02d3a28df0b58c1f30b
BLAKE2b-256 3767e61385c9b1e6efedd6d134eca1f217998f03bedffa2b8da717c3d400ea7e

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp27-cp27mu-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 2.7mu
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 1c443793e14c557acbca00b5fd7cc4e34991d25d75a63de777221e4e383eccdc
MD5 92057fd547b66f5056eba62b753e6a19
BLAKE2b-256 acaff357330abf27463ee19ee6dc8ea7a887f918ddef81b77116c8737f2ebb6f

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a14-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a14-cp27-cp27m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 2.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a14-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 abb939e23df4abcc307ecf26ad3ca34684bd564b4ffb06200e70436ff75bedda
MD5 8524b5c01c10d0b2681f78996efc18a1
BLAKE2b-256 e86f21dc24e17709b97a92ca98e2056bf8a5c384036674b8266c1572943c7c55

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page