z-logo
open-access-imgOpen Access
Issues and Empirical Results for Improving Text Classification
Author(s) -
Youngjoong Ko,
Jungyun Seo
Publication year - 2011
Publication title -
journal of computing science and engineering
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.172
H-Index - 16
eISSN - 2093-8020
pISSN - 1976-4677
DOI - 10.5626/jcse.2011.5.2.150
Subject(s) - computer science , automatic summarization , weighting , search engine indexing , classifier (uml) , artificial intelligence , term (time) , field (mathematics) , data mining , machine learning , information retrieval , medicine , physics , mathematics , quantum mechanics , pure mathematics , radiology
Automatic text classification has a long history and many studies have been conducted in this field. In particular, many machine learning algorithms and information retrieval techniques have been applied to text classification tasks. Even though much technical progress has been made in text classification, there is still room for improvement in text classification. In this paper, we will discuss remaining issues in improving text classification. In this paper, three improvement issues are presented including automatic training data generation, noisy data treatment and term weighting and indexing, and four actual studies and their empirical results for those issues are introduced. First, the semi-supervised learning technique is applied to text classification to efficiently create training data. For effective noisy data treatment, a noisy data reduction method and a robust text classifier from noisy data are developed as a solution. Finally, the term weighting and indexing technique is revised by reflecting the importance of sentences into term weight calculation using summarization techniques.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom