High quality TTS data for four South African languages (af, st, tn, xh)
Identifier: SLR32
Summary: Multi-speaker TTS data for four South African languages, Afrikaans, Sesotho, Setswana and isiXhosa.
Category: Speech
License: Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Downloads (use a mirror closer to you):
af_za.tar.gz [950M] (Audio files and transcriptions for Afrikaans
) Mirrors:
[US]
[EU]
[CN]
st_za.tar.gz [724M] (Audio files and transcriptions for Sesotho
) Mirrors:
[US]
[EU]
[CN]
tn_za.tar.gz [729M] (Audio files and transcriptions for Setswana
) Mirrors:
[US]
[EU]
[CN]
xh_za.tar.gz [907M] (Audio files and transcriptions for isiXhosa
) Mirrors:
[US]
[EU]
[CN]
About this resource:
The data set has had some quality checks, but there might still be errors.
This data set was collected by as a collaboration between North West University and Google.
See LICENSE.txt file for license information.
Copyright 2017 Google, Inc.
If you use this data in publications, please cite it as follows:
@inproceedings{van-niekerk-etal-2017, title = {{Rapid development of TTS corpora for four South African languages}}, author = {Daniel van Niekerk and Charl van Heerden and Marelie Davel and Neil Kleynhans and Oddur Kjartansson and Martin Jansche and Linne Ha}, booktitle = {Proc. Interspeech 2017}, pages = {2178--2182}, address = {Stockholm, Sweden}, month = aug, year = {2017}, URL = {http://dx.doi.org/10.21437/Interspeech.2017-1139} }