AI4D - African Language Dataset Challenge

Kathleen Siminyu, Sackey Freshia


Abstract
As language and speech technologies become more advanced, the lack of fundamental digital resources for African languages, such as data, spell checkers and PoS taggers, means that the digital divide between these languages and others keeps growing. This work details the organisation of the AI4D - African Language Dataset Challenge, an effort to incentivize the creation, curation and uncovering to African language datasets through a competitive challenge, particularly datasets that are annotated or prepared for use in a downstream NLP task.
Anthology ID:
2020.winlp-1.18
Volume:
Proceedings of the The Fourth Widening Natural Language Processing Workshop
Month:
July
Year:
2020
Address:
Seattle, USA
Venues:
ACL | WS | WiNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
68–77
Language:
URL:
https://www.aclweb.org/anthology/2020.winlp-1.18
DOI:
10.18653/v1/2020.winlp-1.18
Bib Export formats:
BibTeX MODS XML EndNote
Video:
 http://slideslive.com/38929555