Open Speech and Language Resources



Contact
dpovey@gmail.com
Phone: 425 247 4129
(Daniel Povey)

Thorsten Müller (TTS)

Identifier: SLR95

Summary: Free single german speaker dataset (> 23 hours) by Thorsten Müller (voice) and Dominik Kreutz (audio optimization) for tts training

Category: Speech

License: Creative Commons (CC0) Licence

Download: thorsten-de_v02.tgz [3.0G]   ( German speech and transcripts )   Mirrors: [US]  

About this resource:

I contribute my personal voice as a person believing in a world where all people are equal. No matter of gender, sexual orientation, religion, skin color and geocoordinates of birth location. A global world where everybody is warmly welcome on any place on this planet and open and free knowledge and education is available to everyone. (Thorsten Müller)

Dataset details:
  • recordings from just one male native german speaker (Thorsten Müller)
  • audio optimized (Dominik Kreutz)
  • ljspeech-1.1 structure
  • 22.668 recorded phrases (wav files)
  • more than 23 hours of pure audio
  • samplerate 22.050Hz
  • mono
  • normalized to -24dB
  • phrase length (min/avg/max): 2 / 52 / 180 chars
  • no silence at beginning/ending
  • avg spoken chars per second: 14
  • sentences with question mark: 2.780
  • sentences with exclamation mark: 1.840
Based on this dataset there're several pretrained tts (text-to-speech) models freely available.

Additional links tl;dr Please don't use for evil!

External URL: https://github.com/thorstenMueller/deep-learning-german-tts/   Free german "thorsten" dataset