Skip to main content

A nendo plugin for speech transcription, based on Whisper by OpenAI.

Project description

Nendo Plugin Transcribe Whisper


nendo core


Website Twitter Discord


A nendo plugin for speech transcription, based on Whisper by OpenAI.

Features

  • Fast speech transcription with optional word-level timestamps.

Requirements

Since we depend on transformers, please make sure that you fulfill their requirements. You also need Pytorch installed on your system, please refer to the pytorch installation instructions.

Installation

  1. Install Nendo
  2. pip install nendo-plugin-transcribe-whisper

If you have a cuda GPU on your machine you can also install flash-attn to get an additional speedup:

pip install flash-attn --no-build-isolation

Then set ATTN_IMPLEMENTATION=flash_attention_2 in your environment variables.

Usage

>>> from nendo import Nendo
>>> nd = Nendo(plugins=["nendo_plugin_transcribe_whisper"])
>>> track = nd.library.add_track(file_path="path/to/file.mp3")

>>> nd.plugins.transcribe_whisper(track=track)
>>> track.get_plugin_value("transcription")

Contributing

Visit our docs to learn all about how to contribute to Nendo: Contributing

License

Nendo: MIT License

Pretrained models: The weights are released under the Apache 2.0 license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nendo_plugin_transcribe_whisper-0.1.0.tar.gz (5.2 kB view hashes)

Uploaded Source

Built Distribution

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page