scispace - formally typeset
Search or ask a question
Author

Haidi Rao

Bio: Haidi Rao is an academic researcher from Anhui Agricultural University. The author has contributed to research in topics: Curse of dimensionality & Feature selection. The author has an hindex of 1, co-authored 1 publications receiving 196 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: Experimental results demonstrate that the proposed feature selection method effectively reduces the dimensions of the dataset and achieves superior classification accuracy using the selected features.

353 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: It is shown that the XGBoost feature selection approach effectively addresses the issue of high landscape heterogeneity and spectral complexities in the image data, successfully augmenting the RF model performance (providing a mean user’s accuracy > 0.82).
Abstract: Although a detailed analysis of land use and land cover (LULC) change is essential in providing a greater understanding of increased human-environment interactions across the coastal region of Bangladesh, substantial challenges still exist for accurately classifying coastal LULC. This is due to the existence of high-level landscape heterogeneity and unavailability of good quality remotely sensed data. This study, the first of a kind, implemented a unique methodological approach to this challenge. Using freely available Landsat imagery, eXtreme Gradient Boosting (XGBoost)-based informative feature selection and Random Forest classification is used to elucidate spatio-temporal patterns of LULC across coastal areas over a 28-year period (1990–2017). We show that the XGBoost feature selection approach effectively addresses the issue of high landscape heterogeneity and spectral complexities in the image data, successfully augmenting the RF model performance (providing a mean user’s accuracy > 0.82). Multi-temporal LULC maps reveal that Bangladesh’s coastal areas experienced a net increase in agricultural land (5.44%), built-up (4.91%) and river (4.52%) areas over the past 28 years. While vegetation cover experienced a net decrease (8.26%), an increasing vegetation trend was observed in the years since 2000, primarily due to the Bangladesh government’s afforestation initiatives across the southern coastal belts. These findings provide a comprehensive picture of coastal LULC patterns, which will be useful for policy makers and resource managers to incorporate into coastal land use and environmental management practices. This work also provides useful methodological insights for future research to effectively address the spatial and spectral complexities of remotely sensed data used in classifying the LULC of a heterogeneous landscape.

173 citations

Journal ArticleDOI
11 May 2020
TL;DR: The proposed methodology can provide a reliable reference for pillar design and stability risk management and achieve a better comprehensive performance than other ML algorithms.
Abstract: Predicting pillar stability is a vital task in hard rock mines as pillar instability can cause large-scale collapse hazards. However, it is challenging because the pillar stability is affected by many factors. With the accumulation of pillar stability cases, machine learning (ML) has shown great potential to predict pillar stability. This study aims to predict hard rock pillar stability using gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM) algorithms. First, 236 cases with five indicators were collected from seven hard rock mines. Afterwards, the hyperparameters of each model were tuned using a five-fold cross validation (CV) approach. Based on the optimal hyperparameters configuration, prediction models were constructed using training set (70% of the data). Finally, the test set (30% of the data) was adopted to evaluate the performance of each model. The precision, recall, and F1 indexes were utilized to analyze prediction results of each level, and the accuracy and their macro average values were used to assess the overall prediction performance. Based on the sensitivity analysis of indicators, the relative importance of each indicator was obtained. In addition, the safety factor approach and other ML algorithms were adopted as comparisons. The results showed that GBDT, XGBoost, and LightGBM algorithms achieved a better comprehensive performance, and their prediction accuracies were 0.8310, 0.8310, and 0.8169, respectively. The average pillar stress and ratio of pillar width to pillar height had the most important influences on prediction results. The proposed methodology can provide a reliable reference for pillar design and stability risk management.

145 citations

Journal ArticleDOI
TL;DR: Experimental results show that FWDT the authors' proposed method performs better for the measures of accuracy, recall and F1-score and it can reduce the required time in constructing the decision tree.
Abstract: In order to improve the classification accuracy, a preprocessing step is used to pre-filter some redundant or irrelevant features before decision tree construction. And a new feature selection algorithm FWDT is proposed based on this. Experimental results show that FWDT our proposed method performs better for the measures of accuracy, recall and F1-score. Furthermore, it can reduce the required time in constructing the decision tree.

92 citations

Journal ArticleDOI
TL;DR: This study strengthens the extant literature by empirically analyzing restaurant reviews posted during the COVID-19 pandemic and discovering suitable deep learning algorithms for different text mining tasks.

90 citations

Journal ArticleDOI
TL;DR: The results demonstrated that the growth rate of the number and depth of the water accumulation points increased significantly after the rainfall return period of 'once in every two years' in Zhengzhou City, and the flooded areas mainly occurred in the old urban areas and parts of southern Zhengzhou.

86 citations