University of Tasmania
Li et al (2023) integrating maching learning under climate change FINAL 21 July 2023.pdf (5.58 MB)

Integrating machine learning and environmental variables to constrain uncertainty in crop yield change projections under climate change

Download (5.58 MB)
journal contribution
posted on 2023-08-22, 05:46 authored by Matthew Harrison, Linchao Li, Yan Zhang, Bin Wang, Puyu Feng, Qinse He, Yu Shi, Ke LiuKe Liu, Matthew HarrisonMatthew Harrison, De Li Liu, Ning Yao, Jianqing He, Hao Feng, Kadambot Siddique, Qiang Yu

Robust crop yield projections under future climates are fundamental prerequisites for reliable policy formation. Both process-based crop models and statistical models are commonly used for this purpose. Process-based models tend to simplify processes, minimize effects of extreme events, and ignore biotic pressures, while statistic al models cannot deterministically capture intricate biological and physiological processes underpinning crop growth. We attempted to integrate and overcome shortcomings in both modelling frameworks by integrating dynamic linear model (DLM) and random forest machine learning model (RF) with nine global gridded crop models (GGCM), respectively, in order to improve projections and reduce uncertainties of maize (Zea mays L.) and soybean (Glycine max [L.] Merrill) yield projections. Our results demonstrated substantial improvements in model performance accuracy by using RF in concert with GGCM across China’s maize and soybean belt. This improvement surpasses that achieved using DLM. For maize, the GGCM+RF models increased the r values from 0.15–0.61 to 0.64–0.77 and decreased nRMSE from approximately 0.20–0.50 to 0.13–0.17 compared with using GGCM alone. For soybean, the models increased r from 0.37–0.70 to 0.54–0.70 and decreased nRMSE from 0.17–0.35 to 0.17–0.20 compared with using GGCM alone. The main factors influencing maize yield changes included chilling days (CD), crop pests and diseases (CPDs), and drought, while for soybean the primary influencing factors included CPD, tropical days (based on exceeding a maximum temperature), and drought. Our approach decreased uncertainties by 33–78% for maize and by 56–68% for soybean . The main source of uncertainty for GGCM was the crop model. For GGCM+RF, the main source of uncertainty for the 2040–2069 period was the global climate model, while the main source of uncertainty for the 2070–2099 period was the climate scenario. Our results provide a novel, robust, and pragmatic framework to constrain uncertainties in order to accurately assess the impact of future climate change on crop yields. These results could be used to interpret future ensemble studies by accounting for uncertainty in crop and climate models, as well as to assess future emissions scenarios.


Advanced machine cognition to segregate effects of climate from management at the landscape scale : Department of Industry, Science, Energy and Resources



  • Article

Publication title

European Journal of Agronomy








TIA - Research Institute



Publication status

  • Published

Rights statement

Copyright 2023 Elsevier B.V. All rights reserved

Socio-economic Objectives

190101 Climate change adaptation measures (excl. ecosystem), 190301 Climate change mitigation strategies, 190502 Climate variability (excl. social impacts), 190199 Adaptation to climate change not elsewhere classified, 190310 Management of greenhouse gas emissions from plant production, 260306 Maize, 260308 Rice, 260311 Soybeans, 260312 Wheat

UN Sustainable Development Goals

13 Climate Action, 1 No Poverty, 12 Responsible Consumption and Production, 13 Climate Action, 15 Life on Land, 2 Zero Hunger, 9 Industry, Innovation and Infrastructure