Deep Learning-Based Analysis of a Real-Time Voice Cloning System


Authors : Ndjoli Isaaka Luc Emmanuel; Kusha K. R

Volume/Issue : Volume 8 - 2023, Issue 7 - July

Google Scholar : https://bit.ly/3TmGbDi

Scribd : https://tinyurl.com/5n8vy42t

DOI : https://doi.org/10.5281/zenodo.8224163

Abstract : Voice technology has emerged as a hotspot for deep learning research due to fast advancements in computer technology. The goal of human-computer interaction should be to give computers the ability to feel, see, hear, and speak. Voice is the most favorable approach for future interactions between humans and computers because it offers more benefits than any other method. One example of voice technology that is capable of imitating a particular person's voice is voice cloning. Real-time voice cloning with only a few samples is proposed as a solution to the issue of having to provide a large amount of samples and having to endure a long time in the past for voice cloning. This strategy deviates from the conventional model. For independent training, different databases and models are used but for joint modeling, only three models are used. The vocoder makes use of a novel type of LPCNET that works well on certain samples and low- performance devices.

Keywords : Real-Time, Samples, Voice Cloning.

Voice technology has emerged as a hotspot for deep learning research due to fast advancements in computer technology. The goal of human-computer interaction should be to give computers the ability to feel, see, hear, and speak. Voice is the most favorable approach for future interactions between humans and computers because it offers more benefits than any other method. One example of voice technology that is capable of imitating a particular person's voice is voice cloning. Real-time voice cloning with only a few samples is proposed as a solution to the issue of having to provide a large amount of samples and having to endure a long time in the past for voice cloning. This strategy deviates from the conventional model. For independent training, different databases and models are used but for joint modeling, only three models are used. The vocoder makes use of a novel type of LPCNET that works well on certain samples and low- performance devices.

Keywords : Real-Time, Samples, Voice Cloning.

CALL FOR PAPERS


Paper Submission Last Date
31 - May - 2024

Paper Review Notification
In 1-2 Days

Paper Publishing
In 2-3 Days

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe