z-logo
open-access-imgOpen Access
Predicting sales by online searching data keywords based on text mining: Evidence from the Chinese automobile market
Author(s) -
Yi Li,
Liangru Yu,
Ruizhi Wen
Publication year - 2019
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1325/1/012071
Subject(s) - word2vec , computer science , automotive industry , similarity (geometry) , word (group theory) , autoregressive model , vector space model , variance (accounting) , support vector machine , space (punctuation) , data mining , artificial intelligence , information retrieval , econometrics , business , mathematics , engineering , geometry , accounting , image (mathematics) , aerospace engineering , operating system
Online searching data reflects consumers’ real footprints in information collection and purchase decision-making processes, which is greatly valued in understanding their needs. This paper which is at the background of China’s automobile market, studies the relationship between online searching data and automobile sales through approaches that differ from existing research to extract keywords. First the online searching data keywords are determined, primarily by using text-mining technology to extract them, and specifically: i) Jieba was used to tokenize crawled automotive forum posts’ text into segmented words; ii) All word-segmented Chinese corpus were segmented into word vector space by Word2vec model; and iii) Similar keywords were discovered by calculating the word vector’s similarity indexes. A fixed effect model was then built based on 108 months of long panel data. Finally, combing with panel vector autoregressive model (PVAR), we used rolling window to predict Chinese automobile sales from January to December 2015.The empirical results demonstrate that: a long equilibrium exists between online searching data and automobile sales; our regression model can explain 76% of the variance. The holdout analysis suggests that online searching data can be of substantial use in forecasting Chinese automobile sales.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here