2 datasets found

Licenses: Creative Commons Attribution Tags: gated

Filter Results
  • Kiswahili Monolingual Corpus

    This dataset contains 100,000 Kiswahili sentences. We want to thank the team at the Makerere AI and Marconi Labs at Makerere University, TAVODET Youth Development (TYD)...
  • Acoli Monolingual Corpus

    Acoli is a very low-resourced language spoken in parts of Northern Uganda. This dataset contains 40,037 Acoli sentences. The sentences were collected and evaluated by Acoli...
You can also access this registry using the API (see API Docs).