z-logo
Premium
Applying Machine Learning for High‐Performance Named‐Entity Extraction
Author(s) -
Baluja Shumeet,
Mittal Vibhu O.,
Sukthankar Rahul
Publication year - 2000
Publication title -
computational intelligence
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.353
H-Index - 52
eISSN - 1467-8640
pISSN - 0824-7935
DOI - 10.1111/0824-7935.00129
Subject(s) - computer science , natural language processing , punctuation , artificial intelligence , task (project management) , speech recognition , management , economics
This paper describes a machine learning approach to building an efficient and accurate name spotting system. Finding names in free text is an important task in many text‐based applications. Most previous approaches were based on hand‐crafted modules encoding language and genre‐specific knowledge. These approaches had at least two shortcomings: They required large amounts of time and expertise to develop and were not easily portable to new languages and genres. This paper describes an extensible system that automatically combines weak evidence from different, easily available sources: parts‐of‐speech tags, dictionaries, and surface‐level syntactic information such as capitalization and punctuation. Individually, each piece of evidence is insufficient for robust name detection. However, the combination of evidence, through standard machine learning techniques, yields a system that achieves performance equivalent to the best existing hand‐crafted approaches.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here