Recent Updates
-
[July-Aug 2024:] I was invited to be a speaker at the ecology workshop of Deep Learning IndabaX Uganda 2024 in Kampala, Uganda. I talked about how we can leverage the use of AI in conservation preservation.
-
[July 2024:] I attended the inaugural African Computer Vision Summer School in Nairobi, Kenya. The summer school entailed an intense 10 days where we had lectures and practicals on dataset construction, advanced architectures for vision, visual representation learning, generative modelling, video understanding, shift, domain adaptation and ethics-ecology. There was also a hackathon where we got to apply lessons we were learning from the summer school and my team won the Research Impact track!
-
[June 2024:] I helped organize, prepare material and host Data Science Africa 2024 in Nyeri, Kenya. It was a pleasure to work with Dr. Adnrew Katumba on the Computer Vision Session where we worked on the use of Roboflow for data annotation and the use of YOLO v8 to classify, detect and segment images from the conservation and health domains. It was also a pleasure to assist Prof. Justin Dauwels with the Generative AI session.
|
|
Analysis of women representation in STEM in Africa
Gabriel Kiarie,
Lorna Mugambi,
Jason Kabi,
Ciira wa Maina
7th DeKUT International Conference on Science, Technology, Innovation and Entrepreneurship, November, 2023.
Paper
Girls and women have consistently been underrepresented in most Science, Technology, Engineering, and Mathematics (STEM) professions, necessitating research [1]. There is a need to define and execute measures and policies to help reduce this gap [2]. The Centre for Data Science and Artificial Intelligence (DSAIL), in collaboration with Gender Justice in STEM Research in Africa (GeJuSTA), is conducting studies to analyse a the representation of women in STEM in Africa. The study will be used to guide the development of policies and curricula aimed at bridging the gap of women representation in STEM. The methods used in this study are analysing the genders of members of staffs in STEM faculties from African universities; analysing the genders of STEM-papers’ authors from African universities and; conducting literature review to evaluate existing measures that have been put in place to encourage and enable women to join STEM professions. Preliminary results show that women are underrepresented in STEM fields in Africa.
|
|
The use of Open-Source Boards for Data Collection and Machine Learning in Remote Deployments
Gabriel Kiarie,
Jason Kabi,
Lorna Mugambi,
Ciira wa Maina
2023 IEEE AFRICON, September.
Paper
Machine learning is being adopted in many walks of life to solve various problems. This is being driven by development of robust machine learning algorithms, availability of large datasets and low-cost computation resources. Some machine learning applications require deployment of devices off-the-grid for data collection and processing. Such applications require development of systems that can operate autonomously during their deployment. This paper presents how some open-source boards have been leveraged for off-grid data collection and machine learning. Advancement in technology has seen development of low-cost and low-power open-source boards that can be interfaced with a wide array of sensors for data collection and can perform computation processes. The boards are finding wide applications in data collection and machine learning initiatives. A wide array of open source boards exists in the market. The boards can generally be divided into micro controllers, single board computers and field programmable gate arrays. These boards have different properties in terms of processing capabilities, power consumption, and communication interfaces and features. For off-grid data collection and machine learning tasks, resources such as power and network for communication are limited in most cases. These factors should be considered when choosing boards for off-grid deployment tasks. The boards chosen should optimise the use of these resources while meeting the processing capabilities required for the tasks at hand.
|
|
Efficient Camera Trap Image Annotation Using YOLOv5
Yuri Njathi,
Lians Wanjiku,
Lorna Mugambi,
Jason Kabi,
Gabriel Kiarie,
Ciira wa Maina
2023 IEEE AFRICON, September.
Paper
Using camera traps to acquire wildlife images is becoming more common within conservancies. The information provided by these camera traps enhances understanding of wildlife behaviour and population patterns. The detection and counting of animals present in each of the captured images is valuable information as it can be used to guide conservation efforts. Manual annotation of these wildlife images is a tedious painful process. It is becoming more common to use tools that either use AI to annotate camera trap datasets or use AI to aid in annotation. These AI tools are usually trained on species endemic to a particular region. The ability to fine-tune such models to species endemic to one's particular region is important to save much of the time conservationists manually look through the misclassified images. In this paper, we present a case study where we used a YOLOv5 object detection model trained to detect the presence and count the number of impala and other animals from a dataset collected by researchers at the Dedan Kimathi University of Technology Conservancy. We analyze the results of the AI's performance with respect to a manually annotated dataset. The model was able to annotate 72% of the dataset at a human level of accuracy. The work here shows promise with regard to time spent labelling camera trap images by leveraging the presence of particular species to auto-annotate a majority of the dataset.
|
|
Unsupervised Discovery of Echocardiographic Views for Rheumatic Heart Disease Diagnosis
Yuri Njathi,
Lorna Mugambi,
Ciira wa Maina,
Liesl Zühlke
2023 IST-Africa Conference (IST-Africa), May.
Paper
RHD is a cardiovascular disease that causes damage to the heart valves. If the damage is severe it is rectified using expensive valve replacement surgery. Early diagnosis of the disease allows for cost-friendly preventive measures. Specific views of the heart are required for proper assessment by heart specialists. Since routine screening is recommended for the rapid early identification of RHD, a large amount of patient data is generated. To handle this influx of data, trained AI is being used to automate view classification, unfortunately, the high cost of obtaining expert-labelled data in terms of time and money is prohibitive. Thus, we explore how the use of unsupervised AI methods can aid experts in the faster labelling of the data and what patterns PCA and agglomerative clustering identify in echo videos. We found that after appropriate preprocessing, these unsupervised methods can group videos with similar echocardiographic views. We also found that these methods were sensitive to the specific machines used to acquire the data and therefore care should be taken when applying them to data collected using different machines.
|
|
DSAIL-Porini: Annotated camera trap image data of wildlife species from a conservancy in Kenya
Lorna Mugambi,
Jason Kabi,
Gabriel Kiarie,
Ciira wa Maina
Data in Brief, February.
Paper
For years, zoologists, ecologists, and researchers at large have been using instruments such as camera traps in acquiring images of wild animals non-intrusively for ecological research. The main reason behind ecological research is to increase the understanding of various interactions in ecosystems while providing supporting data and information. Due to climate change and the destruction of animal habitats in recent years, researchers have been conducting studies on diminishing populations of various species of interest and the effectiveness of habitat restoration practices. By collecting and examining wild animal image data, inferences such as the health, breeding rate, and population of a particular species can be made. This paper presents an annotated camera trap dataset, DSAIL-Porini1, consisting of images of wildlife species captured in a conservancy in Nyeri, Kenya. 6 wildlife species are captured in this dataset: impala, bushbuck, Sykes’ monkey, defassa waterbuck, common warthog, and Burchell's zebra. This dataset was collected using camera traps based on the Raspberry Pi 2, Raspberry Pi Zero, and OpenMV Cam H7. It provides an example of images collected using relatively low-cost hardware platforms. The image dataset can be used in training and testing object detection and classification machine learning models.
|
|
Towards AI Based Diagnosis of Rheumatic Heart Disease: Data Annotation and View Classification
Lorna Mugambi,
Ciira wa Maina,
Liesl Zühlke
2022 IST-Africa Conference (IST-Africa), May.
Paper
Rheumatic Heart Disease is a cardiovascular disease highly prevalent in developing countries partially because of inadequate healthcare infrastructure to treat Group A streptococcus pharyngitis and thereafter diagnose and document every case of Acute Rheumatic Fever, the immune-mediated antecedent of rheumatic heart disease. Secondary antibiotic treatment with penicillin injections after a diagnosis of Acute Rheumatic Fever and Rheumatic Heart Disease is used to prevent further attacks of Strep A, preferably prior to any heart valve damage. Echocardiographic screening for early detection of Rheumatic Heart Disease has been proposed as a method to improve outcomes but it is time-consuming, costly and few people are skilled enough to reach a correct diagnosis. Machine Learning is an emerging tool in analysing medical images; our aim is to automate the screening process of diagnosing rheumatic heart disease. In this paper, we present a web application to be used to label echocardiography data. These labelled data can then be used to develop machine learning models that can classify echocardiographic views of the heart and damaged valves from the echocardiograms.
|
|
DSAIL-Porini: Annotated camera trap images of wildlife species from a conservancy in Kenya
Lorna Mugambi,
Gabriel Kiarie,
Jason Kabi,
Ciira wa Maina
Mendeley Data, March 2022.
Dataset
This dataset has camera trap images of wildlife species from a conservancy in Kenya and their annotation. They are based on the Raspberry Pi 2, Raspberry Pi Zero and the OpenMV Cam H7 devices. The camera traps were deployed in the conservancy from June 2021 to December 2021. We have 6 categories of grazing mammals in this dataset; Burchell's zebra, Defassa waterbuck, bushbuck, Common warthog, impala and the Syke's monkey.
|
|