University of Tasmania
Kuhn_etal_2018_The_Utility_of_Machine_Learning_in_Identification_of_Key_geophysical_and_geochemical_datasets.pdf (466.18 kB)

The utility of machine learning in identification of key geophysical and geochemical datasets: a case study in lithological mapping in the Central African copper belt

Download (466.18 kB)
conference contribution
posted on 2023-05-23, 13:29 authored by Kuhn, SD, Matthew CracknellMatthew Cracknell, Anya ReadingAnya Reading

Random Forests, a supervised machine learning algorithm, provides a robust, data driven means of predicting lithology from geophysical, geochemical and remote sensing data. As an essential part of input selection, datasets are ranked in order of importance to the classification outcome. Those ranked most important provide, on average, the most decisive split between lithological classes. These rankings provide explorers with an additional line of reasoning to complement conventional, geophysical and geochemical interpretation workflows. The approach shows potential to aid in identifying important criteria for distinguishing geological map units during early stage exploration. This can assist in directing subsequent expenditure towards the acquisition and further development of datasets which will be the most productive for mapping.

In this case study, we use Random Forests to classify the lithology of a project in the Central African Copper-Belt, Zambia. The project area boasts extensive magnetic, radiometric, electromagnetic and multi-element geochemical coverage but only sparse geological observations. Under various training data paradigms, Random Forests produced a series of varying but closely related lithological maps. In this study, training data were restricted to outcrop, simulating the data available at the early stages of the project. Variable ranking highlighted those datasets which were of greatest importance to the result. Both geophysical and geochemical datasets were well represented in the highest ranking variables, reinforcing the importance of access to both data types. Further analysis showed that in many cases, the importance of high ranking datasets had a plausible geological explanation, often consistent with conventional interpretation. In other cases the method provides new insights, identifying datasets which may not have been considered from the outset of a new project.


Publication title

Proceedings of the Australasian Exploration Geoscience Conference 2018




School of Natural Sciences


CSIRO publishing

Place of publication


Event title

Australasian Exploration Geoscience Conference 2018

Event Venue

Sydney, Australia

Date of Event (Start Date)


Date of Event (End Date)


Rights statement

Copyright 2018 the Authors

Repository Status

  • Open

Socio-economic Objectives

Copper ore exploration

Usage metrics

    University Of Tasmania


    Ref. manager