4 datasets found

Formats: CSV

Filter Results
  • The Makerere Gendered Corpus: A Gendered English to Luganda Parallel Corpus

    This English-Luganda parallel sentence corpus consists of gendered examples created by a team of researchers from Makerere AI Lab at Makerere University with a team of Luganda...
  • Lumasaba Monolingual Corpus

    Lumasaba sometimes known as Lugisu is a Bantu language spoken in the Eastern part of Uganda. This dataset contains a total of 39,999 sentences. The sentences are split into two...
  • Luganda Monolingual Corpus

    This dataset contains 100,000 Luganda sentences. Luganda is a Bantu language and is one of the major languages spoken in Uganda. This dataset was compiled by researchers at the...
  • Acoli Monolingual Corpus

    Acoli is a very low-resourced language spoken in parts of Northern Uganda. This dataset contains 40,037 Acoli sentences. The sentences were collected and evaluated by Acoli...