RVC Tutorial - Speak in any voice! - Retrieval-based Voice Conversion - Easy AI Voice Tutorial

#aivoices #aivoice #ai #aitutorial #rvc #rvcproject #rvcgui, RVC WebUI, RVC AI Tutorial, RVC GUI Tutorial, RVC Project Tutorial, AI Voice Tutorial, RVC V2, rvc voice changer In this video you’ll be learning how to speak in any voice using nothing but your PC and a microphone. Everything will be running locally on your machine. First, we’ll prepare an audio file that will serve as an input to train an AI model. We then train the model using RVC-Project (Retrieval-based Voice Conversion) before using the model in a different and much simpler User Interface (RVC-GUI). Once you have everything set up, you’ll be able to convert a recording of your voice to AI voice within seconds. Notes: - Even in early 2024, this is still the best method and tool to clone any voice with your own voice locally on your PC - This works with any language and you just need to train the model with the same language that you want to clone (If you use a different language for training than for cloning, then you will get an accent that is close to real accents). My other videos about AI voice-cloning: - Real-time method for Discord (or Zoom, Skype, etc) to make your own voice sound like any voice: - Use Text-To-Speech with any voice: If you run into memory issues, try the following: - Lower the batch size to “1“. - Cut the audio in clips shorter than 10 seconds. - Reduce the size of the dataset. For any other issues, make sure the folder path of your input voice and of RVC Beta does not contain any spaces or special characters! /UPDATE 1/ Download Pretrained Voices /UPDATE 2/ Text To Speech Tutorial (The .pth models are not compatible to the app in this tutorial): /UPDATE 3/ I have now trained the voice with a dataset of 30 minutes and used 600 epochs. The resulting voice sounds better but still is not perfect. Maybe I should go even higher on the epochs. / UDATE 4/ When you create the zip file with the .pth model, also include the .index file that starts with “added...“ which you can find in the /logs/lecturer/ folder /Update 5/ I have now trained a dataset of 40 minutes with 300 epochs and this seems to give me the best overall results so far UPDATE 6/ I have now trained my original 10 minute sample with the RMVPE model (instead of Harvest) and this seems to have improved or reduced some of the robotic noises I was getting. RMVPE is available in this version of RVC: Using “Harvest“ in RVC-GUI works great with an RMVPE-trained model. What you’ll need - NVIDIA GPU with CUDA support (at least 8GB VRAM needed for the training) - About 30GB of free disk space - About 10 minutes of the voice you want to train the AI model on - A recording of your own voice that will then be converted into AI voice Download Links 1. Prepare input voice with Audacity 2. Train Model with RVC-Project (RVC-Beta) Note: ““ always includes the latest version of the tool. If you want to use the exact same version as in the video, download this one: 3. Use Model with RVC-GUI Optional If you want to dive deeper into RVC-Project, check out the documentation on Github: Thank you to everyone who has contributed to RVC-Project and RVC-GUI! If you appreciate my videos, you can buy me a Coffee: My PC Components: (Disclosure: As an Amazon Associate, I earn from qualifying purchases. Clicking on and purchasing products through these links won’t cost you any extra. They help support this channel and allow me to continue providing valuable content) My GPU: ZOTAC Gaming GeForce RTX 4090 AMP Extreme (Affiliate Link) (Alternative: Zotac NVIDIA GeForce RTX 4090 Trinity (Affiliate Link)) My CPU: INTEL CORE I9-13900KF (Affiliate Link) My SSD: WD_BLACK SN850X NVMe SSD 2TB (Affiliate Link) My RAM: 64GB 2x32GB DDR5 6400MHz (Affiliate Link) My Microphone: Razer Seiren V2 X USB Microphone (Affiliate Link) Chapters: 00:00 Introduction 01:44 Step 1 - Prepare Input Voice 03:11 Step 2 - Train Voice Model in RVC V2 07:55 Step 3 - Use Voice Model in RVC GUI 11:00 Final Result
Back to Top