Clone a voice in 5 seconds to generate arbitrary speech in real-time
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
π A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
singing voice change based on whisper, and lora for singing voice clone
A webui for different audio related Neural Networks
Voice Conversion by CycleGAN (θ―ι³ε ι/θ―ι³θ½¬ζ’): CycleGAN-VC2
This repository has implementation for "Neural Voice Cloning With Few Samples"
The code for the bark-voicecloning model. Training and inference.
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Get updates on the fastest growing repos and cool stats about GitHub right in your inbox
Once per month. No spam.