Data Collections

studiorecording1.jpgAppen provides customised end-to-end data collection services for:

  • Speech
  • Text
  • Handwriting
  • Multi-modal data
  • Other speech and language data

Appen data collections have spanned the globe and a wide range of collection modes.

Data Collection Types and Locations

Appen has performed speech and language data collections in more than 80 languages across 40+ countries around the world -- from North and South-East Asia, North Africa, the Middle East, Europe, Scandinavia, North and South America. [More]

Data Collection Hardware and Software

Appen will recommend, source and manage the IT and telecommunications solution for a data collection. Appen has a range of recording equipment, including microphones, recording software, recording platforms, audio interface and visual display equipment and telecommunications infrastructure. [More]

Language and Linguistic Services

Appen will analyse and make recommendations on the language of collection. This includes analysis of the dialects of a language and recommendations on the proportions of speakers that should be collected from various dialectal groups and regions. It can also make recommendations on project details such as the number of speakers and demographic diversity. Appen currently has licensable language analyses documentation available for a number of languages. [More]