Open Speech and Language Resources



Contact
dpovey@gmail.com
Phone: 425 247 4129
(Daniel Povey)

High quality TTS data for Sundanese.

Identifier: SLR44

Summary: Multi-speaker TTS data for Sundanese (su-ID)

Category: Speech

License: Attribution-ShareAlike 4.0 (CC BY-SA 4.0)

Downloads (use a mirror closer to you):
su_id_female.zip [861M]   (Sundanese data from female speakers )   Mirrors: [China]  
su_id_male.zip [610M]   (Sundanese data from female speakers )   Mirrors: [China]  
LICENSE [20K]   (License information )   Mirrors: [China]  

About this resource:

This data set contains high-quality transcribed audio data for Sundanese. The data set consists of wave files, and a TSV file. The file line_index.tsv contains a filename and the transcription of audio in the file. Each filename is prepended with a speaker identification number.

The data set has been manually quality checked, but there might still be errors.

This dataset was collected by Google in collaboration with Universitas Pendidikan Indonesia.

See LICENSE file for license information.

Copyright 2016, 2017, 2018 Google LLC