TeamBeam - Meta-Data Extraction from Scientific Literature | Zendy

Roman Kern | Zendy; Kris Jack | Zendy; Maya Hristakeva | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

TeamBeam - Meta-Data Extraction from Scientific Literature

Author(s) -

Roman Kern,

Kris Jack,

Maya Hristakeva

Publication year - 2012

Publication title -

d-lib magazine

Language(s) - English

Resource type - Journals

ISSN - 1082-9873

DOI - 10.1045/july2012-kern

Subject(s) - extraction (chemistry) , data extraction , computer science , information retrieval , data science , medline , chromatography , political science , chemistry , law

An important aspect of the work of researchers as well as librarians is to manage collections of scientific literature. Social research networks, such as Mendeley and CiteULike, provide services that support this task. Meta-data plays an important role in providing services to retrieve and organise the articles. In such settings, meta-data is rarely explicitly provided, leading to the need for automatically extracting this valuable information. The TeamBeam algorithm analyses a scientific article and extracts structured meta-data, such as the title, journal name and abstract, as well as information about the article's authors (e.g. names, e-mail addresses, affiliations). The input of the algorithm is a set of blocks generated from the article text. A classification algorithm, which takes the sequence of the input into account, is then applied in two consecutive phases. In the evaluation of the algorithm, its performance is compared against two heuristics and three existing meta-data extraction systems. Three different data sets with varying characteristics are used to assess the quality of the extraction results. TeamBeam performs well under testing and compares favourably with existing approaches.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research