z-logo
open-access-imgOpen Access
User Name Alias Extraction in Emails
Author(s) -
Meijuan Yin,
Junyong Luo,
Ding Cao,
Xiaonan Liu,
Yongxing Tan
Publication year - 2011
Publication title -
international journal of image graphics and signal processing
Language(s) - English
Resource type - Journals
eISSN - 2074-9082
pISSN - 2074-9074
DOI - 10.5815/ijigsp.2011.03.01
Subject(s) - alias , computer science , world wide web , information retrieval , database
Finding out user identity information from emails is one of the important research topics in email mining. Most approaches extract an email user's name only from the header of an email, but there are often many name information appearing in the body of emails, and those names are usually more suitable for representing the sender's or recipient's identity. This paper focuses on the problem of extracting email users' name aliases in the body of plain-text emails. After locating and extracting salutation and signature blocks from email bodies, we can identify the potential aliases in the salutation and signature lines, which can be directly associated with the corresponding email address in email headers, by using named entity recognition(NER) tools. However the identified aliases may be half-baked or there are still some potential aliases that can't be correctly identified. So we propose a novel approach to efficiently and accurately extract aliases in the salutation and signature lines based on name boundary word template built on the characteristics of alias neighboring words. Results on the public subset of the Enron corpus indicate that the approaches presented in this paper can efficiently extract user's aliases from email bodies.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom