Crowdsourced high-quality Tamil multi-speaker speech data set.
Identifier: SLR65
Summary: Data set which contains recordings of native speakers of Tamil.
Category: Speech
License: Attribution-ShareAlike 4.0 International
Downloads (use a mirror closer to you):
about.html [1.4K] (Information about the data set
) Mirrors:
[US]
[EU]
[CN]
LICENSE [20K] (License information for the data set
) Mirrors:
[US]
[EU]
[CN]
line_index_female.tsv [447K] (Lines recorded by the female speakers
) Mirrors:
[US]
[EU]
[CN]
line_index_male.tsv [380K] (Lines recorded by the male speakers
) Mirrors:
[US]
[EU]
[CN]
ta_in_female.zip [769M] (Archive containing recordings from female speakers
) Mirrors:
[US]
[EU]
[CN]
ta_in_male.zip [603M] (Archive file recordings from male speakers
) Mirrors:
[US]
[EU]
[CN]
About this resource:
The data set has been manually quality checked, but there might still be errors.
Please report any issues in the following issue tracker on GitHub. https://github.com/googlei18n/language-resources/issues
See LICENSE file for license information.
Copyright 2018, 2019 Google, Inc.
If you use this data in publications, please cite it as follows:
@inproceedings{he-etal-2020-open, title = {{Open-source Multi-speaker Speech Corpora for Building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu Speech Synthesis Systems}}, author = {He, Fei and Chu, Shan-Hui Cathy and Kjartansson, Oddur and Rivera, Clara and Katanova, Anna and Gutkin, Alexander and Demirsahin, Isin and Johny, Cibu and Jansche, Martin and Sarin, Supheakmungkol and Pipatsrisawat, Knot}, booktitle = {Proceedings of The 12th Language Resources and Evaluation Conference (LREC)}, month = may, year = {2020}, address = {Marseille, France}, publisher = {European Language Resources Association (ELRA)}, pages = {6494--6503}, url = {https://www.aclweb.org/anthology/2020.lrec-1.800}, ISBN = "{979-10-95546-34-4}, }