Semantic Web Approach to the GitHub Database Processing
Author(s) -
Piotr Czekalski,
Paweł Micek
Publication year - 2014
Publication title -
lecture notes on software engineering
Language(s) - English
Resource type - Journals
ISSN - 2301-3559
DOI - 10.7763/lnse.2015.v3.201
Subject(s) - computer science , database , world wide web , information retrieval , semantic web
Despitethe vast choice of RDFS (resource description framework schema) vocabularies being published in projects like LOV (Linked Open Vocabularies), there are still areas for which proper ontology has to be developed. GitHub, being both a reliable source of open Git repositories and a social platform, introduces new termsfor which no proper description by means of the Semantic Web has been made yet. This paper regards every aspect of introducing a new set of semantic data: extracting the data, creating or refining the vocabulary, and possible ways to operate on properly integrated data set using SPARQL (SPARQL protocol and RDF query language). Authors of the paper would like to present a methodology of developing a unique RDFS vocabulary. Having this vocabulary, data extracted from GitHub can be described bythe use of new terms as well as those from well-established ontologies, such as SIOC (semantically-interlinked online communities), DOAP (description of a project) or DC (dublin core).
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom