Skip to main content

ValidDataSet - TTS Lj Speech Dataset Validator

Project description

ValidDataSet

About [ Menu ]

ValidDataSet was created to help validate datasets created based on the Lj Speech Dataset (for Tacotron, Flowtron, Waveglow, or RadTTS).

VDS works based on plugins (which can be dynamically added by the user in the future).

Descriptions of current plugins can be found in the Plugins section.

Plugins [ Menu ]

Below is a list of currently used plugins (new ones will be added over time).

ID Name Version Description
F001 WavsTranscriptionChecker 23.2.26 Check if all files have been added to the transcription files
T001 DatasetStructureChecker 23.2.26 Check if the "wavs" folder and transcription files exist in the dataset
T002 EmptyLineChecker 23.2.26 Check if there are empty lines in the transcriptions
T003 FilesInTranscriptionChecker 23.2.26 Check if all files added to transcription exist
T004 ExistingWavFileTranscriptionChecker 23.2.26 Check if all files added to transcription have a transcription
T005 PunctuationMarksChecker 23.2.26 Check if all transcriptions end with punctuation marks: ".", "?" or "!"
T006 PunctuationMarksChecker 23.2.26 Check if all lines have the same number of PIPE characters
T007 DuplicatedTranscriptionChecker 23.2.26 Check if there are any duplicate paths to WAV files in the transcriptions

Installation [ Menu ]

To install ValidDataSet, use the following command:

pip install vds

Usage [ Menu ]

List of parameters supported by VDS:

 -p, --path            Path to dataset
 -d, --disable         Disable plugins
 -f, --files           Set transcription file names
     --list-plugins    List plugins
 -v, --verbose         Print additional information

Sample commands and their description:

List all plugins:

vds --list-plugins

Run VDS with all plugins without additional information:

vds --path /media/username/Disk/Dataset_name/

Run VDS with all plugins with additional information:

vds --path /media/username/Disk/Dataset_name/ -v

Run VDS without plugins F001,T002,T006 with additional information:

vds --path /media/username/Disk/Dataset_name/ --disable F001,T002,T006 -v

Run VDS without plugins F001,T002,T006 with own transcription names and with additional information:

vds --path /media/username/Disk/Dataset_name/ --disable F001,T002,T006 --files train.txt,val.txt -v

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vds-23.3.1.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vds-23.3.1-py3-none-any.whl (13.1 kB view details)

Uploaded Python 3

File details

Details for the file vds-23.3.1.tar.gz.

File metadata

  • Download URL: vds-23.3.1.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.1 CPython/3.10.9 Linux/6.0.0-6-amd64

File hashes

Hashes for vds-23.3.1.tar.gz
Algorithm Hash digest
SHA256 a474d2858f90d387e09f4ba566ddb0533ba52c03c3eac200a0ba5b4a4b5df110
MD5 cf20c12d392cbf2ce3e0f1fd3ed8b81a
BLAKE2b-256 9f738784ba20bf2cdced2d45aac131e4b40e525c95c411d2eaeea1129067ed49

See more details on using hashes here.

File details

Details for the file vds-23.3.1-py3-none-any.whl.

File metadata

  • Download URL: vds-23.3.1-py3-none-any.whl
  • Upload date:
  • Size: 13.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.3.1 CPython/3.10.9 Linux/6.0.0-6-amd64

File hashes

Hashes for vds-23.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 23257aef353da817f3c775633820e872ccdf4f7a8f3d7b8a93feb63a5295ce9e
MD5 c5f57a8bb48478a7821998ec34d2fc46
BLAKE2b-256 743e7f560a109507bd535e13337fd85e5d31492eb00825cfebe5b21cd40b97d5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page