Open Speech and Language Resources

Phone: 425 247 4129
(Daniel Povey)

High quality TTS data for Bengali languages

Identifier: SLR37

Summary: Multi-speaker TTS data for Bangladesh Bengali (bn-BD) and Indian Bengali (bn-IN).

Category: Speech

License: License: Attribution-ShareAlike 4.0 (CC BY-SA 4.0)

Downloads (use a mirror closer to you): [586M]   (Bangladesh Bengali data )   Mirrors: [China] [416M]   (Indian Bengali data )   Mirrors: [China]  
README.txt [503 bytes]   (Information about the data )   Mirrors: [China]  
LICENSE.txt [20K]   (License information )   Mirrors: [China]  

About this resource:

This data is transcribed high-quality speech data for Bengali.

The data collection was perfomed by Google.

If you use this data in publications, please cite it as follows:

    title = {{A Step-by-Step Process for Building TTS Voices Using Open Source Data and Framework for Bangla, Javanese, Khmer, Nepali, Sinhala, and Sundanese}},
    author = {Keshan Sodimana and Knot Pipatsrisawat and Linne Ha and Martin Jansche and Oddur Kjartansson and Pasindu De Silva and Supheakmungkol Sarin},
    booktitle = {Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU)},
    year  = {2018},
    address = {Gurugram, India},
    month = aug,
    pages = {66--70},
    URL   = {}