Data Mining and Machine Learning Algorithms for Optimizing Maize Yield Forecasting in Central Europe

dc.contributor.authorHarsányi, Endre
dc.contributor.authorBashir, Bashar
dc.contributor.authorArshad, Sana
dc.contributor.authorOcwa, Akasairi
dc.contributor.authorVad, Attila
dc.contributor.authorAlsalman, Abdullah
dc.contributor.authorMohammed, Safwan
dc.contributor.authorSzéles, Adrienn
dc.contributor.authorHijazi, Omar
dc.contributor.authorRátonyi, Tamás
dc.contributor.authorBácskai, István
dc.date.accessioned2024-06-10T09:23:59Z
dc.date.available2024-06-10T09:23:59Z
dc.date.issued2023-05
dc.descriptionP.(1-22) ;en_US
dc.description.abstractArtificial intelligence, specifically machine learning (ML), serves as a valuable tool for decision support in crop management under ongoing climate change. However, ML implementation to predict maize yield is still limited in Central Europe, especially in Hungary. In this context, we assessed the performance of four ML algorithms (Bagging (BG), Decision Table (DT), Random Forest (RF) and Artificial Neural Network-Multi Layer Perceptron (ANN-MLP)) in predicting maize yield based on four different input scenarios. The collected data included both agricultural data (production (PROD) (ton) and maize cropped area (AREA) (ha)) and climate data (annual mean temperature C (Tmean), precipitation (PRCP) (mm), rainy days (RD), frosty days (FD) and hot days (HD)). This research adopted four scenarios, as follows: SC1: AREA+ PROD+ Tmean+ PRCP+ RD+ FD+ HD; SC2: AREA+ PROD; SC3: Tmean+ PRCP+ RD+ FD+ HD; and SC4: AREA+ PROD+ Tmean+ PRCP. In the training stage, ANN-MLP-SC1 and ANN-MLP-SC4 outperformed other ML algorithms; the correlation coefficient (r) was 0.99 for both, while the root mean squared errors (RMSEs) were 107.9 (ANN-MLP-SC1) and 110.7 (ANN-MLP-SC4). In the testing phase, the ANN-MLP-SC4 had the highest r value (0.96), followed by ANN-MLP-SC1 (0.94) and RF-SC2 (0.94). The 10-fold cross validation also revealed that the ANN-MLP-SC4 and ANN-MLP-SC1 have the highest performance. We further evaluated the performance of the ANN-MLP-SC4 in predicting maize yield on a regional scale (Budapest). The ANN-MLP-SC4 succeeded in reaching a high-performance standard (r = 0.98, relative absolute error = 21.87%, root relative squared error = 20.4399% and RMSE = 423.23). This research promotes the use ofANNas an efficient tool for predicting maize yield, which could be highly beneficial for planners and decision makers in developing sustainable plans for crop management. Keywords: maize yield; climate; multilayer perceptron; random forest; optimum modelen_US
dc.identifier.citationHarsányi, E., Bashir, B., Arshad, S., Ocwa, A., Vad, A., Alsalman, A., ... & Mohammed, S. (2023). Data mining and machine learning algorithms for optimizing maize yield forecasting in central Europe. Agronomy, 13(5), 1297en_US
dc.identifier.urihttps://hdl.handle.net/20.500.12504/1787
dc.language.isoenen_US
dc.publisherMDPIen_US
dc.subjectMaize yielden_US
dc.subjectClimateen_US
dc.subjectMultilayer perceptronen_US
dc.subjectRandom foresten_US
dc.subjectOptimum modelen_US
dc.titleData Mining and Machine Learning Algorithms for Optimizing Maize Yield Forecasting in Central Europeen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AGRONOMY-13-01297-v2.pdf
Size:
4.28 MB
Format:
Adobe Portable Document Format
Description:
PDF

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections