An AI speech processing toolkit

State of the art speech recognition, text-to-speech, spoken language understanding, and more.

Auth.js logo

Robust pre-trained models

Integrate Whisper, wav2vec 2.0, XLS-R, WavLM, HuBERT and more with S3PRL.

1 2 3 4 git clone https://github.com/espnet/espnet cd espnet/tools . ./setup_anaconda.sh anaconda espnet 3.8 make

Installation

All-inclusive or Python modules only.

0

WandB Integration

Seamlessly track and compare experiments with Weights and Biases.

1 2 cd espnet/egs2/librispeech . ./run.sh

Recipes for 100+ Datasets

Data downloading and processing pre-handled. Spend more time developing instead of cleaning.

Built for any speech processing task

And supports NLP tasks, including Machine Translation and Language Modelling

Powered by Predecent