Skip to content

AMI Meeting Corpus

~100 h of recorded English meetings with synchronized audio, video, and rich annotations including transcripts and speaker labels.

Recommendation

A standard benchmark for meeting-domain ASR, speaker diarization, and overlapping speech, with headset and far-field mic conditions. Choose it when you need multi-party conversational audio. Only ~100 h and largely non-native English, so it is small for training from scratch.

Getting the data

Obtain from the dataset homepage.

Also accessible via OpenSLR (openslr.org/16) and the HF mirror edinburghcstr/ami.

Suggested processing

A recommended VoxKitchen pipeline ships in the repository at voxkitchen/templates/pipelines/speaker-analysis.yaml — run it with vkit docker run.