Ensemble Machine Learning Methods to Estimate the Sugarcane Yield Based on Remote Sensing Information
Author(s) -
Sandeep Kumar Singla,
Rahul Garg,
Om Prakash Dubey
Publication year - 2020
Publication title -
revue d intelligence artificielle
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.146
H-Index - 14
eISSN - 1958-5748
pISSN - 0992-499X
DOI - 10.18280/ria.340607
Subject(s) - yield (engineering) , ensemble learning , machine learning , computer science , artificial intelligence , remote sensing , geography , materials science , metallurgy
Received: 8 October 2020 Accepted: 12 December 2020 The purpose of this study is to investigate the computing capabilities of machine learning algorithms and remotely sensed signals to extract the agricultural information. Many techniques and models have been developed to extract information from the remotely sensed observations, but it remains an exigent problem due to the accuracy, reliability and timeliness parameters. Sugarcane yield estimation based on the temporal profile of multispectral Landsat-8 data has been explored in the proposed work. An initial attempt has been made in this study to select important parameters to be used as input to the machine learning method. Mean Decrease Accuracy and Mean Decrease Gini measures of random forest algorithm have been used to select the important parameters for predictive modelling. The results of the study revealed that Green Normalized Vegetation Index, Normalized Difference Vegetation Index and Land Surface Water Index performed best among other indices. Bands B2, B3, B6 and B7 of Landsat-8 recorded as top scorers. The proposed work focused on ensemble machine learning methods to optimize the correlation of historical crop yield values with spectral information. The Random Forest method exhibits a significant performance (RMSE= 1.51 t/ha and R = 0.94) as compared with other methods such as Classification and Regression Tree, Support Vector Regression and K-Nearest Neighbor. The proposed model based on random forest algorithm is best among all the scenarios and growth stages, whereas model based on classification and regression tree performs worst in all the cases. The proposed study indicates that the numerical value of a single spectral parameter and single-date data is not sufficient for the reliable yield estimation because it is difficult to discriminate some of the crops due to similar phenology in a particular growth period.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom