Dataset - CKAN

A dataset of necrotized cassava root cross-section images

This dataset contains images of cassava root cross-sections captured by the Makerere University Artificial Intelligence Lab in conjunction with the National Crop Resources...
- ZIP
Dataset of Crops part one

This dataset consists of five classes of data in the training set: cassava, sugarcane, maize, cashew, and coffee images. This is the first part, which consists of five out of...
Data of Crops Part Two

This is the second part of the data "https://doi.org/10.7910/DVN/J0OS9R". It consists of two classes that remained from the training set's data: weeds and unknown. Plus, the...
The Makerere Gendered Corpus: A Gendered English to Luganda Parallel Corpus

This English-Luganda parallel sentence corpus consists of gendered examples created by a team of researchers from Makerere AI Lab at Makerere University with a team of Luganda...
- CSV
Kiswahili Monolingual Corpus

This dataset contains 100,000 Kiswahili sentences. We want to thank the team at the Makerere AI and Marconi Labs at Makerere University, TAVODET Youth Development (TYD)...
- TXT
Lumasaba Monolingual Corpus

Lumasaba sometimes known as Lugisu is a Bantu language spoken in the Eastern part of Uganda. This dataset contains a total of 39,999 sentences. The sentences are split into two...
- CSV
Luganda Monolingual Corpus

This dataset contains 100,000 Luganda sentences. Luganda is a Bantu language and is one of the major languages spoken in Uganda. This dataset was compiled by researchers at the...
- CSV
Acoli Monolingual Corpus

Acoli is a very low-resourced language spoken in parts of Northern Uganda. This dataset contains 40,037 Acoli sentences. The sentences were collected and evaluated by Acoli...
- CSV

You can also access this registry using the API (see API Docs).

8 datasets found