Data for Good: AI for Wildlife Image Classification to Analyze Camera Trap Datasets

Human activity has put enormous pressure on natural habitats across the world. Many species are finding it difficult to survive in the era of climate change and environmental degradation. Animals forced into smaller areas succumb to new diseases, which can spread to humans and cause pandemics. The question we ask ourselves is how can AI and ML be implemented to stop this. Is there potential here for wildlife image classification? Biodiversity conservation practitioners and researchers are working to alleviate and reverse these processes through monitoring and protecting wild species. Increasingly they utilize cutting-edge technologies to support and streamline their work. Camera traps have become a ubiquitous tool for these purposes, and now machine learning offers further advancements by exponentially increasing the speed of wildlife image classification. As part of our Data for Good initiative, we were given the opportunity to contribute to the work of biodiversity conservationists at the National Parks Agency in Gabon in collaboration with experts from the University of Stirling. We are proud to announce that we will use our computer vision skills in a project which recently received support from the Google for Education fund. Read on to learn more about AI-assisted biodiversity conservation and our project to protect wildlife in Gabon.
Camera traps – opportunities and challenges
Camera trap technology (automated photography of animals) is becoming more affordable and more accessible to biodiversity conservationists and is becoming the gold standard for monitoring many elusive or rare terrestrial mammals. Machine learning methods are revolutionizing the analysis of large camera trap datasets (which often contain millions of images) by automating the identification of animals in photographs.

Existing solutions and their limitations
The challenge of quickly classifying large image datasets has been described and addressed by academics and skilled practitioners alike. For instance, Norouzzadeh et al. (2018) discovered that deep learning techniques could automate animal identification for over 99% of images of wildlife in a dataset from the Serengeti Ecosystem in northern Tanzania. They showed that “automatically, accurately, and inexpensively collecting such data could help catalyze the transformation of many fields of ecology, wildlife biology, zoology, conservation biology, and animal behavior into ‘big data’ sciences.” Tabak et al. (2018) provide further support for the utilization of machine learning for wildlife image classification in a resource-constrained context. First, they replicate the results from Norouzzadeh et al. (2018) showing 98% accuracy of an image classification model developed using a dataset from the United States. Second, the authors show that this same model could be successfully reused for out-of-sample datasets from Canada and Tanzania. This lends credibility to the idea of developing a universal model, which could be used globally. Third, they successfully run the model offline on a laptop computer. This is a fast-developing area of research and there are other solutions that aim to tackle the issue of analyzing the vast amounts of data generated from camera traps. Whilst novel, they often do not take into consideration the limited resources available to conservationist practitioners, not only in the African context but also in other regions where biodiversity conservation is underfunded. For instance, Wildlife Insights is a new cloud-based solution. It utilizes deep learning techniques for automatic wildlife image classification. However, it necessitates access to high-speed internet, proprietary data management systems, and other tools which are both costly and require specialized IT knowledge. Solutions provided by scholars such as Tabak et al. (2018) can run offline on basic machines, however, they require significant expertise in the use of data science tools such as R and computer programming and they are not standalone applications. This precludes their use by experts from other domains. Overall, existing tools typically require users to have some knowledge of computer programming and/or access to a fast internet connection (e.g. to use cloud-based platforms such as Zamba, Wildlife Insights, or the Marco Willi classifier). This poses a significant problem for many conservationists, who are only rarely experienced in computer programming and who often work in remote regions with little or no internet access. Furthermore, the current focus of machine-learning tools in Africa has been for savannah species, and these can be unsuitable for forest species. To allow conservationists to fully exploit the huge potential of camera traps for species monitoring, protection, and discovery, there is a need for simple software tools that can implement cutting-edge machine learning algorithms on basic hardware without internet access.Appsilon’s approach to solving the problem
In building the Data for Good initiative at Appsilon we advocate an exploratory approach. My experiences working in international development led me to believe that many Western-developed, techno-centric solutions do not bring the anticipated results to end-users outside of this demographic, because they fail to appreciate the more complex reality of the context in which they are applied. Designers tend to look for a problem to solve with an exciting technological development rather than the other way around. To prevent repeating this mistake we engage with specialists in the respective fields, where machine learning and other cutting-edge technological solutions can provide assistance. To this end, we entered a collaboration with Dr. Robin Whytock and Dr. Katharine Abernethy, who are world-class experts on biodiversity conservation, and Robin is also an experienced data science practitioner. Together we set out to integrate both new and existing machine learning algorithms into a simple-to-install and easy-to-use, an open-source software tool that can automatically analyze camera trap data without internet access. New algorithms will be created using a recently collated, unique database of 1.6 million labeled images taken across Central Africa during the last decade. The project will initially focus on Central African forest species in Gabon, but will also expand to encompass all of the Central African forest zones.
