Feature Extraction of Museum Big Data Text Information Based on the Similarity Mapping Algorithm
Author(s) -
Zhe Yang,
Huiqin Wang,
Qixuan Tang,
Ting Wang,
Shaowen Wang,
Yulei Kong
Publication year - 2022
Publication title -
mobile information systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.346
H-Index - 34
eISSN - 1875-905X
pISSN - 1574-017X
DOI - 10.1155/2022/9611559
Subject(s) - computer science , big data , information retrieval , web crawler , set (abstract data type) , similarity (geometry) , word (group theory) , feature (linguistics) , search engine indexing , feature extraction , construct (python library) , column (typography) , web page , data mining , artificial intelligence , frame (networking) , world wide web , programming language , linguistics , image (mathematics) , philosophy , telecommunications
Under big data, a large number of features, as well as their complex data types, make traditional feature extraction and knowledge reasoning unable to adapt to new conditions. To solve these problems, this study proposes a museum big data feature extraction method based on a similarity mapping algorithm. Under the museum big data analysis, the museum big data text information is collected through web crawler technology. The web crawler is used to index the content of websites all across the Internet so that the museum websites can appear in search engine results and the collected text information is denoised and smoothed by a Gaussian filter to construct the processed text information set mapping matrix. The semantic similarity is computed according to the text word concept. Based on the calculation results, through word frequency and document probability inverse document frequency weight, the museum big data text information features are extracted. Simulation results show that the proposed method has high accuracy and short extraction time. Through the comparative analysis, it can be realized that this method not only solves the problems existing in traditional methods but also lays a foundation for the analysis of museum massive data.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom