AMI Meeting Corpus

~100 h of recorded English meetings with synchronized audio, video, and rich annotations including transcripts and speaker labels.

Task: asr, speaker
Languages: en
Hours: 100
Domain: meetings
License: CC BY 4.0
Homepage: https://groups.inf.ed.ac.uk/ami/corpus/
Paper: https://dl.acm.org/doi/10.1007/11677482_3

Recommendation

A standard benchmark for meeting-domain ASR, speaker diarization, and overlapping speech, with headset and far-field mic conditions. Choose it when you need multi-party conversational audio. Only ~100 h and largely non-native English, so it is small for training from scratch.

Getting the data

Obtain from the dataset homepage.

Also accessible via OpenSLR (openslr.org/16) and the HF mirror edinburghcstr/ami.

Suggested processing

A recommended VoxKitchen pipeline ships in the repository at voxkitchen/templates/pipelines/speaker-analysis.yaml — run it with vkit docker run.