Skip to content

DiPCo (Dinner Party Corpus)

English far-field conversational corpus of 10 dinner-party sessions (4 participants each, 15-45 minutes per session) recorded with one close-talk microphone plus five 7-mic far-field array devices, designed for noise-robust distant ASR and diarization.

Recommendation

Pick for benchmarking far-field multi-microphone ASR, speaker diarization, and source separation ("cocktail-party") in informal conversational English. The corpus is small (~5h total) — use as an evaluation set, not primary training data.

Getting the data

Obtain from the dataset homepage.

Zenodo mirror at https://zenodo.org/records/8122551; verify array geometry expectations before plugging into a pipeline.

Suggested processing

A recommended VoxKitchen pipeline ships in the repository at voxkitchen/templates/pipelines/speaker-analysis.yaml — run it with vkit docker run.