openslr.org

Open Speech and Language Resources

High quality TTS data for Bengali languages

Identifier: SLR37

Summary: Multi-speaker TTS data for Bangladesh Bengali (bn-BD) and Indian Bengali (bn-IN).

Category: Speech

License: License: Attribution-ShareAlike 4.0 (CC BY-SA 4.0)

Downloads (use a mirror closer to you):
bn_bd.zip [586M] (Bangladesh Bengali data ) Mirrors: [US] [EU] [CN]
bn_in.zip [416M] (Indian Bengali data ) Mirrors: [US] [EU] [CN]
README.txt [503 bytes] (Information about the data ) Mirrors: [US] [EU] [CN]
LICENSE.txt [20K] (License information ) Mirrors: [US] [EU] [CN]

About this resource:

This data is transcribed high-quality speech data for Bengali.

The data collection was perfomed by Google.

If you use this data in publications, please cite it as follows:

  @inproceedings{kjartansson-etal-tts-sltu2018,
    title = {{A Step-by-Step Process for Building TTS Voices Using Open Source Data and Framework for Bangla, Javanese, Khmer, Nepali, Sinhala, and Sundanese}},
    author = {Keshan Sodimana and Knot Pipatsrisawat and Linne Ha and Martin Jansche and Oddur Kjartansson and Pasindu De Silva and Supheakmungkol Sarin},
    booktitle = {Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU)},
    year  = {2018},
    address = {Gurugram, India},
    month = aug,
    pages = {66--70},
    URL   = {http://dx.doi.org/10.21437/SLTU.2018-14}
  }