Applications of Deep Learning in News Text Classification | Zendy

Menghan Zhang | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Applications of Deep Learning in News Text Classification

Author(s) -

Menghan Zhang

Publication year - 2021

Publication title -

scientific programming

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.269

H-Index - 36

eISSN - 1875-919X

pISSN - 1058-9244

DOI - 10.1155/2021/6095354

Subject(s) - computer science , artificial intelligence , pace , word (group theory) , feature (linguistics) , feature vector , natural language processing , feature engineering , deep learning , vector space model , information retrieval , mathematics , linguistics , geography , philosophy , geometry , geodesy

The advancement in technology is taking place with an accelerating pace across the globe. With the increasing expansion and technological advancement, a vast volume of text data are generated everyday, in the form of social media platform, websites, company data, healthcare data, and news. Indeed, it is a difficult task to extract intriguing patterns from the text data, such as opinions, summaries, and facts, having varying length. Because of the problems of the length of text data and the difficulty of feature value extraction in news, this paper proposes a news text classification method based on the combination of deep learning (DL) algorithms. In order to classify the text data, the earlier approaches use a single word vector to express text information and only the information of the relationship between words were considered, but the relationship between words and categories was ignored which indeed is an important factor for the classification of news text. This paper follows the idea of a customized algorithm which is the combination of DL algorithms such as CNN, LSTM, and MLP and proposes a customized DCLSTM-MLP model for the classification of news text data. The proposed model is expressed in parallel with word vector and word dispersion. The relationship among words is represented by the word vector as an input of the CNN module, and the relationship between words and categories is represented by a discrete vector as an input of the MLP module in order to realize comprehensive learning of spatial feature information, time-series feature information, and relationship between words and categories of news text. To check the stability and performance of the proposed method, multiple experiments were performed. The experimental results showed that the proposed method solves the problems of text length, difficulty of feature extraction in the news text, and classification of news text in an effective way and attained better accuracy, recall rate, and comprehensive value as compared to the other models.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research