z-logo
open-access-imgOpen Access
Application of locally linear embedding algorithm on hotel data text classification
Author(s) -
Huang Jin-ming
Publication year - 2020
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1634/1/012014
Subject(s) - nonlinear dimensionality reduction , embedding , dimensionality reduction , manifold (fluid mechanics) , boosting (machine learning) , dimension (graph theory) , artificial intelligence , computer science , logistic regression , data set , statistical classification , pattern recognition (psychology) , mathematics , machine learning , combinatorics , mechanical engineering , engineering
As a non-linear dimension reduction method, manifold learning algorithm projects high-dimensional input to a low-dimensional space by maintaining the local structure of the data, and discovers the inherent geometric structure hidden in the data. In this paper, we attempt to apply the manifold learning algorithm to the field of Chinese text classification, and use the locally linear embedding algorithm to reduce the dimension of the ctrip hotel review data set. Then, we utilize extreme gradient boosting (XGBoost) and logistic regression to classify the text. Experimental results show that it is effective and feasible to use manifold learning algorithm for text classification. Moreover, the classification effect of logistic regression is better than XGBoost in the text classification of hotel reviews.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here