TIMIT Acoustic-Phonetic Continuous Speech Corpus

630 American English speakers across 8 dialect regions, each reading 10 phonetically rich sentences, with time-aligned phonetic and word transcriptions.

Task: asr
Languages: en
Hours: 5
Domain: phonetic read
License: see source terms
Homepage: https://catalog.ldc.upenn.edu/LDC93S1

Recommendation

Canonical benchmark for phonetic recognition, acoustic-phonetic studies, and dialect/phoneme analysis — pick for small-scale academic work where standard comparability matters. Avoid for modern large-scale ASR training (only ~5h, read, dated 1993, paid LDC license).

Getting the data

Obtain from the dataset homepage.

Paid LDC distribution (LDC93S1); DOI 10.35111/17gk-bn40.

Suggested processing

A recommended VoxKitchen pipeline ships in the repository at voxkitchen/templates/pipelines/asr-training-data.yaml — run it with vkit docker run.