RAVDESS

Acted emotional speech and song from 24 professional actors across 8 emotions at two intensity levels (1,440 speech audio files).

Task: emotion, speaker
Languages: en
Domain: acted emotional
License: CC BY-NC-SA 4.0
Homepage: https://zenodo.org/records/1188976
Paper: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0196391

Recommendation

A clean, balanced, widely-used benchmark for speech emotion recognition — good for reproducible, controlled experiments and quick baselines. Emotions are acted (not naturalistic), the corpus is small, and the non-commercial license prohibits commercial use.

Getting the data

Obtain from the dataset homepage.

Open access on Zenodo; non-commercial license.

Suggested processing

A recommended VoxKitchen pipeline ships in the repository at examples/pipelines/emotion-recognize.yaml — run it with vkit docker run.