CREMA-D

7,442 acted audio-visual emotional clips from 91 demographically diverse actors speaking 12 sentences in 6 emotions at 4 intensity levels.

Task: emotion, speaker
Languages: en
Domain: acted emotional
License: ODbL 1.0 (database) + DbCL 1.0 (contents)
Homepage: https://github.com/CheyneyComputerScience/CREMA-D
Paper: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4313618/

Recommendation

Good for demographically diverse acted emotion recognition and audio-visual affect work, with crowd-sourced perceptual ratings included. Choose it when speaker/ethnic diversity matters; emotion is acted and the corpus is modest in size.

Getting the data

Obtain from the dataset homepage.

Openly available on GitHub under Open Data Commons licenses.

Suggested processing

A recommended VoxKitchen pipeline ships in the repository at examples/pipelines/emotion-recognize.yaml — run it with vkit docker run.