Personal tools

Corpora

From SynSIG

By default, each corpus is text/audio. Some corpora contain more information. In this case, they are classified under a dedicated section (i.e. Audiovisual for audiovisual speech)

Contents

Multilingual

Idlak/Living-Audio-Dataset

https://github.com/Idlak/Living-Audio-Dataset

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

https://github.com/Kyubyong/CSS10

The Simple4All Tundra Corpus

http://tundra.simple4all.org


Language specific

English

Alba speech corpus

https://doi.org/10.7488/ds/2506

Parallel Audiobook Corpus

https://doi.org/10.7488/ds/2468

VCTK

https://doi.org/10.7488/ds/2645

Hurricane natural speech corpus - higher quality version

https://doi.org/10.7488/ds/2482

Repeated Harvard Sentence Prompts corpus version 0.5

https://doi.org/10.7488/ds/39

The Voice Conversion Challenge 2018 database

https://doi.org/10.7488/ds/2337

The Voice Conversion Challenge 2016 database

https://doi.org/10.7488/ds/1575

LibriTTS corpus

http://www.openslr.org/60/

CSTR NAM TIMIT Plus corpus

http://homepages.inf.ed.ac.uk/jyamagis/page3/page57/page57.html

The LJ Speech Dataset

https://keithito.com/LJ-Speech-Dataset/

DAPS (Device and Produced Speech) Dataset - A dataset of professional production quality speech and corresponding aligned speech recorded on common consumer devices

https://archive.org/details/daps_dataset


French

The SIWIS French Speech Synthesis Database

https://doi.org/10.7488/ds/1705


Japanese

JVS corpus

https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus

JSUT corpus

https://sites.google.com/site/shinnosuketakamichi/publication/jsut


Norwegian

NB Tale - a basic acoustic phonetic speech database for Norwegian

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-31&lang=en

Tuva Speech Database

https://www.nb.no/sprakbanken/show?serial=oai:nb.no:sbr-44&lang=en

Romanian

Romanian Speech Synthesis (RSS) Database

http://romaniantts.com/rssdb/

The SWARA Corpus

https://speech.utcluj.ro/swarasc/


Articulatory data

mngu0

http://www.mngu0.org

Audiovisual

TCD-TIMIT (audio visual multi-speaker database)

https://sigmedia.tcd.ie/TCDTIMIT/node/1

The GRID audiovisual sentence corpus

http://spandh.dcs.shef.ac.uk/gridcorpus/

Non classified

PTDB-TUG: Pitch Tracking Database from Graz University of Technology

https://www.spsc.tugraz.at/databases-and-tools/ptdb-tug-pitch-tracking-database-from-graz-university-of-technology.html


SynSIG is a Special Interest Group of ISCA, the International Speech Communication Association.

SynSIG 1998-2020