Data Mining and Machine Learning Algorithms for Optimizing Maize Yield Forecasting in Central Europe
dc.contributor.author | Harsányi, Endre | |
dc.contributor.author | Bashir, Bashar | |
dc.contributor.author | Arshad, Sana | |
dc.contributor.author | Ocwa, Akasairi | |
dc.contributor.author | Vad, Attila | |
dc.contributor.author | Alsalman, Abdullah | |
dc.contributor.author | Mohammed, Safwan | |
dc.contributor.author | Széles, Adrienn | |
dc.contributor.author | Hijazi, Omar | |
dc.contributor.author | Rátonyi, Tamás | |
dc.contributor.author | Bácskai, István | |
dc.date.accessioned | 2024-06-10T09:23:59Z | |
dc.date.available | 2024-06-10T09:23:59Z | |
dc.date.issued | 2023-05 | |
dc.description | P.(1-22) ; | en_US |
dc.description.abstract | Artificial intelligence, specifically machine learning (ML), serves as a valuable tool for decision support in crop management under ongoing climate change. However, ML implementation to predict maize yield is still limited in Central Europe, especially in Hungary. In this context, we assessed the performance of four ML algorithms (Bagging (BG), Decision Table (DT), Random Forest (RF) and Artificial Neural Network-Multi Layer Perceptron (ANN-MLP)) in predicting maize yield based on four different input scenarios. The collected data included both agricultural data (production (PROD) (ton) and maize cropped area (AREA) (ha)) and climate data (annual mean temperature C (Tmean), precipitation (PRCP) (mm), rainy days (RD), frosty days (FD) and hot days (HD)). This research adopted four scenarios, as follows: SC1: AREA+ PROD+ Tmean+ PRCP+ RD+ FD+ HD; SC2: AREA+ PROD; SC3: Tmean+ PRCP+ RD+ FD+ HD; and SC4: AREA+ PROD+ Tmean+ PRCP. In the training stage, ANN-MLP-SC1 and ANN-MLP-SC4 outperformed other ML algorithms; the correlation coefficient (r) was 0.99 for both, while the root mean squared errors (RMSEs) were 107.9 (ANN-MLP-SC1) and 110.7 (ANN-MLP-SC4). In the testing phase, the ANN-MLP-SC4 had the highest r value (0.96), followed by ANN-MLP-SC1 (0.94) and RF-SC2 (0.94). The 10-fold cross validation also revealed that the ANN-MLP-SC4 and ANN-MLP-SC1 have the highest performance. We further evaluated the performance of the ANN-MLP-SC4 in predicting maize yield on a regional scale (Budapest). The ANN-MLP-SC4 succeeded in reaching a high-performance standard (r = 0.98, relative absolute error = 21.87%, root relative squared error = 20.4399% and RMSE = 423.23). This research promotes the use ofANNas an efficient tool for predicting maize yield, which could be highly beneficial for planners and decision makers in developing sustainable plans for crop management. Keywords: maize yield; climate; multilayer perceptron; random forest; optimum model | en_US |
dc.identifier.citation | Harsányi, E., Bashir, B., Arshad, S., Ocwa, A., Vad, A., Alsalman, A., ... & Mohammed, S. (2023). Data mining and machine learning algorithms for optimizing maize yield forecasting in central Europe. Agronomy, 13(5), 1297 | en_US |
dc.identifier.uri | https://hdl.handle.net/20.500.12504/1787 | |
dc.language.iso | en | en_US |
dc.publisher | MDPI | en_US |
dc.subject | Maize yield | en_US |
dc.subject | Climate | en_US |
dc.subject | Multilayer perceptron | en_US |
dc.subject | Random forest | en_US |
dc.subject | Optimum model | en_US |
dc.title | Data Mining and Machine Learning Algorithms for Optimizing Maize Yield Forecasting in Central Europe | en_US |
dc.type | Article | en_US |