Speech Synthesis - Creating Custom Voices [closed]

Tags:

speech-synthesis

Is it possible, programatically, to take someone's voice sample and produce a unique tone/property that could be used to create a synthesised speech?

For example, person A records himself. A unique tone is produced from this voice sample, and is being turned into synthesis speech. This allows people to use this synthetic voice in Text-to-Speech software, writing any text that they want that would be read in person A's voice.

Is it possible in today's terms? I know that there are companies that do this professionally, but generally, is it possible for a piece of software to do this?

557

asked Apr 08 '14 17:04

Travier

1 Answers

Using speaker adaptation methods you can achieve some results with comparably few training samples but still you should have some hundred sentences of the person - preferably with a phonetic transcription.

We once had this as a small lab exercise for students to record their own voices and train a voice model using HTS (http://hts.sp.nitech.ac.jp/). The "most simple" approach using HTS is to download the "Speaker dependent training demo" from this page and replace the training speech samples with your own recordings (of the same sentences!). We did this for another language with our own package though.

I think MaryTTS (http://mary.dfki.de/) has some more convenient tools to assist with this process but I've never worked with that.

But still - for high quality voices, you should have thousands of recorded sentences.

answered Sep 22 '22 16:09

Markus Toman

Related questions
                            
                                Launch app on voice command (android)
                            
                                What software or service can I use to programatically make phone calls with? [closed]
                            
                                Launch Google Now or phone default voice search?
                            
                                Is there a way to have change female to male voice during the conversation in DialogFlow (Api.ai)
                            
                                play raw audio file in python in realtime
                            
                                Does anyone know any service similar to Tropo?
                            
                                How can I control my application with built-in voice control?
                            
                                Android TTS Male Female Voice Change
                            
                                Android compare two sounds for phonetic matching
                            
                                Availability of installed voices for use by AVSpeechSynthesis in iOS
                            
                                IVR development [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With