Login

Login
Welcome:
Guest

Search for:


Browse:

Bannner: Aslib individual membership.
 
Journal search
Journal cover: Program: electronic library and information systems

Program: electronic library and information systems

ISSN: 0033-0337

Online from: 1966

Subject Area: Library and Information Studies

Content: Latest Issue | icon: RSS Latest Issue RSS | Previous Issues

Options: To add Favourites and Table of Contents Alerts please take a Emerald profile

Icon: .Table of Contents.Icon: .

DBpedia and the live extraction of structured data from Wikipedia


Document Information:
Title:DBpedia and the live extraction of structured data from Wikipedia
Author(s):Mohamed Morsey, (Department of Computer Science, University of Leipzig, Leipzig, Germany), Jens Lehmann, (Department of Computer Science, University of Leipzig, Leipzig, Germany), Sören Auer, (Department of Computer Science, University of Leipzig, Leipzig, Germany), Claus Stadler, (Department of Computer Science, University of Leipzig, Leipzig, Germany), Sebastian Hellmann, (Department of Computer Science, University of Leipzig, Leipzig, Germany)
Citation:Mohamed Morsey, Jens Lehmann, Sören Auer, Claus Stadler, Sebastian Hellmann, (2012) "DBpedia and the live extraction of structured data from Wikipedia", Program: electronic library and information systems, Vol. 46 Iss: 2, pp.157 - 181
Keywords:Data management, Databases, Knowledge extraction, Knowledge management, RDF, Triplestore, Websites, Wikipedia
Article type:Research paper
DOI:10.1108/00330331211221828 (Permanent URL)
Publisher:Emerald Group Publishing Limited
Acknowledgements:The authors thank the people at Openlink software for their valuable support during the development of DBpedia-Live. This work was supported by a grant from the European Union's 7th Framework Programme provided for the projects LOD2 (GA no. 257943) and LATC (GA no. 256975).
Abstract:

Purpose – DBpedia extracts structured information from Wikipedia, interlinks it with other knowledge bases and freely publishes the results on the web using Linked Data and SPARQL. However, the DBpedia release process is heavyweight and releases are sometimes based on several months old data. DBpedia-Live solves this problem by providing a live synchronization method based on the update stream of Wikipedia. This paper seeks to address these issues.

Design/methodology/approach – Wikipedia provides DBpedia with a continuous stream of updates, i.e. a stream of articles, which were recently updated. DBpedia-Live processes that stream on the fly to obtain RDF data and stores the extracted data back to DBpedia. DBpedia-Live publishes the newly added/deleted triples in files, in order to enable synchronization between the DBpedia endpoint and other DBpedia mirrors.

Findings – During the realization of DBpedia-Live the authors learned that it is crucial to process Wikipedia updates in a priority queue. Recently-updated Wikipedia articles should have the highest priority, over mapping-changes and unmodified pages. An overall finding is that there are plenty of opportunities arising from the emerging Web of Data for librarians.

Practical implications – DBpedia had and has a great effect on the Web of Data and became a crystallization point for it. Many companies and researchers use DBpedia and its public services to improve their applications and research approaches. The DBpedia-Live framework improves DBpedia further by timely synchronizing it with Wikipedia, which is relevant for many use cases requiring up-to-date information.

Originality/value – The new DBpedia-Live framework adds new features to the old DBpedia-Live framework, e.g. abstract extraction, ontology changes, and changesets publication.



Fulltext Options:

Login

Login

Existing customers: login
to access this document

Login


- Forgot password?

- Athens/Institutional login

Purchase

Purchase

Downloadable; Printable; Owned
HTML, PDF (415kb)Purchase

To purchase this item please login or register.

Login


- Forgot password?

Recommend to your librarian

Complete and print this form to request this document from your librarian


Marked list

Bookmark & share

Reprints & permissions

© Emerald Group Publishing Limited  |  Copyright information  |  Site policies  |  Cookie information
.