The JSUT single speaker corpus is part of the JSUT collection for Japanese speech corpora connecting speech, song, and audio events. The transcription metadata is modeled after LJSpeech, making it compatible with most speech synthesis projects. Tiny: Total clips: 285 Min duration: 3.019 secs Max duration: 9.462 secs Mean duration: 4.871 secs Total duration: 00:23:08 Small: Total clips: 8812 Min duration: 3.007 secs Max duration: 14.431 secs Mean duration: 4.951 secs Total duration: 12:07:12 Large: Total clips: 22910 Min duration: 3.007 secs Max duration: 14.988 secs Mean duration: 4.984 secs Total duration: 31:42:54 X Large: Total clips: 43253 Min duration: 3.007 secs Max duration: 14.988 secs Mean duration: 4.993 secs Total duration: 59:59:40 The Kokoro Speech Datasets contains about 43,253 audio recordings for 14 novel books taken from Aozora Bunko. CSS10 Japanese is a subset of CSS10 and contains about 14 hours of audio files for 明暗 (Meian). All audio recordings is based on the audio books from LibriVox. Each audio clip is about 2 to 18 seconds in length. Jejueo Single Speaker Speech Datasets is part of the initiative by the Center for Jeju Studies. Jejueo, also known as Jeju language, is a Korean language used on the Jeju Island. There are about 12,853 audio clips, resulting in more than 12 hours of audio data. The audio recordings are recorded by a professional female voice actress reading from selected books. KSS is one of the first publicly available datasets for Korean. Hence, a researcher took the initiative to split the audio recordings and re-align the transcriptions. In the original version, each audio file is simply too long as input data for text-to-speech training. The World English Bible is a revised version of audio bibles recording provided by AudioTreasure. Each clips consist of 1 to 10 seconds, resulting in more than 23 hours of audio data. There are about 13,100 audio clips based on 7 non-fiction books. LJSpeech is one of the most commonly used datasets for text-to-speech.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |