z-logo
open-access-imgOpen Access
TeamBeam - Meta-Data Extraction from Scientific Literature
Author(s) -
Roman Kern,
Kris Jack,
Maya Hristakeva
Publication year - 2012
Publication title -
d-lib magazine
Language(s) - English
Resource type - Journals
ISSN - 1082-9873
DOI - 10.1045/july2012-kern
Subject(s) - extraction (chemistry) , data extraction , computer science , information retrieval , data science , medline , chromatography , political science , chemistry , law
An important aspect of the work of researchers as well as librarians is to manage collections of scientific literature. Social research networks, such as Mendeley and CiteULike, provide services that support this task. Meta-data plays an important role in providing services to retrieve and organise the articles. In such settings, meta-data is rarely explicitly provided, leading to the need for automatically extracting this valuable information. The TeamBeam algorithm analyses a scientific article and extracts structured meta-data, such as the title, journal name and abstract, as well as information about the article's authors (e.g. names, e-mail addresses, affiliations). The input of the algorithm is a set of blocks generated from the article text. A classification algorithm, which takes the sequence of the input into account, is then applied in two consecutive phases. In the evaluation of the algorithm, its performance is compared against two heuristics and three existing meta-data extraction systems. Three different data sets with varying characteristics are used to assess the quality of the extraction results. TeamBeam performs well under testing and compares favourably with existing approaches.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom