Skip to content

Earnings-21

39 hours of 44 English-language earnings calls from 2020 across nine financial sectors, professionally transcribed by Rev.com for benchmarking ASR on named-entity-dense speech.

Recommendation

Use as an entity-dense ASR evaluation benchmark for long-form financial/business audio, especially when testing proper-noun and ticker handling. Small at 39h — best as an eval/probe set, not for large-scale training.

Getting the data

Obtain from the dataset homepage.

Suggested processing

A recommended VoxKitchen pipeline ships in the repository at voxkitchen/templates/pipelines/asr-training-data.yaml — run it with vkit docker run.