How to clone my own voice locally #185

gitihobo · 2023-08-14T08:20:20Z

I want to create my own dataset, how do I do this? I cant seem to make sense of the dataset section

bitplane · 2023-09-15T11:09:28Z

Dunno if it'll work but I did this:

Used Mozilla Common Voice to help build a dataset for the whole world
Downloaded my data containing a couple of thousand recordings through the Common Voice export page in my profile
Converted the mp3 files in the zip to wav files and put them in a folder called wavs
Then took the text from the export and put the name of the wav followed by the text, pipe separated: wav_file.wav|Text spoken, and put it in metadata.csv
replaced all the fancy quotes for " and the fancy apostrophes with '
zipped this up
Ran docker build . -t voice-cloning-app in the project dir
Then docker run --gpus all -p 5000:5000 -v$(pwd)/data:/app/data voice-cloning-app
Then went to http://localhost:5000/ and uploaded the zip file as an import
It then said it'd take a year to run, so I exported it and used the colab notebook. Had to make a couple of changes to get it working:

You'll need this file in your Drive dir too, call it pretrained.pt:

Provide feedback