Showing papers by "Christian Bizer published in 2010"

PDF

Open Access

Book Chapter•DOI•

[...]

Rasmus Hahn, Christian Bizer¹, Christopher Sahnwaldt, Christian Herta, Scott Robinson, Michaela Bürgle, Holger Düwiger, Ulrich Scheel - Show less +4 more•Institutions (1)

Free University of Berlin¹

03 May 2010

TL;DR: Feted Wikipedia Search is presented, an alternative search interface for Wikipedia, which facilitates infobox data in order to enable users to ask complex questions against Wikipedia knowledge and helps them to truly exploit Wikipedia’s collective intelligence.

...read moreread less

Abstract: Wikipedia articles contain, besides free text, various types of structured information in the form of wiki markup. The type of wiki content that is most valuable for search are Wikipedia infoboxes, which display an article’s most relevant facts as a table of attribute-value pairs on the top right-hand side of the Wikipedia page. Infobox data is not used by Wikipedia’s own search engine. Standard Web search engines like Google or Yahoo also do not take advantage of the data. In this paper, we present Faceted Wikipedia Search, an alternative search interface for Wikipedia, which facilitates infobox data in order to enable users to ask complex questions against Wikipedia knowledge. By allowing users to query Wikipedia like a structured database, Faceted Wikipedia Search helps them to truly exploit Wikipedia’s collective intelligence.

...read moreread less

106 citations

LDspider: an open-source crawling framework for the web of linked data

[...]

Robert Isele¹, Jürgen Umbrich², Christian Bizer¹, Andreas Harth³•Institutions (3)

Free University of Berlin¹, National University of Ireland, Galway², Karlsruhe Institute of Technology³

09 Nov 2010

TL;DR: LDSpider traverses the Web of Linked Data by following RDF links between data items, it supports different crawling strategies and allows crawled data to be stored either in files or in an RDF store.

...read moreread less

Abstract: The Web of Linked Data is growing and currently consists of several hundred interconnected data sources altogether serving over 25 billion RDF triples to the Web. What has hampered the exploitation of this global dataspace up till now is the lack of an open-source Linked Data crawler which can be employed by Linked Data applications to localize (parts of) the dataspace for further processing. With LDSpider, we are closing this gap in the landscape of publicly available Linked Data tools. LDSpider traverses the Web of Linked Data by following RDF links between data items, it supports different crawling strategies and allows crawled data to be stored either in files or in an RDF store.

...read moreread less

104 citations

Silk server - adding missing links while consuming linked data

[...]

Robert Isele¹, Anja Jentzsch¹, Christian Bizer¹•Institutions (1)

Free University of Berlin¹

08 Nov 2010

TL;DR: Silk Server as mentioned in this paper is an identity resolution component, which can be used within Linked Data application architectures to augment Web data with additional RDF links, and is designed to be used with an incoming stream of RDF instances, produced for example by a linked data crawler.

...read moreread less

Abstract: The Web of Linked Data is built upon the idea that data items on the Web are connected by RDF links. Sadly, the reality on the Web shows that Linked Data sources set some RDF links pointing at data items in related data sources, but they clearly do not set RDF links to all data sources that provide related data. In this paper, we present Silk Server, an identity resolution component, which can be used within Linked Data application architectures to augment Web data with additional RDF links. Silk Server is designed to be used with an incoming stream of RDF instances, produced for example by a Linked Data crawler. Silk Server matches the RDF descriptions of incoming instances against a local set of known instances and discovers missing links between them. Based on this assessment, an application can store data about newly discovered instances in its repository or fuse data that is already known about an entity with additional data about the entity from the Web. Afterwards, we report on the results of an experiment in which Silk Server was used to generate RDF links between authors and publications from the Semantic Web Dog Food Corpus and a stream of FOAF profiles that were crawled from the Web.

...read moreread less

86 citations

The R2R framework: publishing and discovering mappings on the web

[...]

Christian Bizer¹, Andreas Schultz¹•Institutions (1)

Free University of Berlin¹

08 Nov 2010

TL;DR: A language for publishing expressive, named mappings on the Web and a composition method for chaining partial mappings from different sources based on a mapping quality assessment heuristic are proposed.

...read moreread less

Abstract: The promise of the Web of Linked Data is to enable client applications to discover new data sources by following RDF links at run-time and to smoothly integrate data from these sources. Linked Data sources use different vocabularies to describe the same type of objects. It is also common practice to mix terms from different widely used vocabularies with proprietary terms. Thus Linked Data applications need to apply mappings to translate Web data to their local schema before doing any sophisticated data processing. Maintaining a local or central set of mappings that cover all Linked Data sources is likely to be impossible due to the size and dynamics of the Web of Linked Data. Thus this paper propagates a distributed, pay-as-you-go integration approach where data publishers, vocabulary maintainers and third parties may publish expressive mappings on the Web. A client application which discovers data that is represented using terms that are unknown to the application may search the Web for mappings and apply the discovered mappings to translate data to its local schema. We propose a language for publishing expressive, named mappings on the Web and a composition method for chaining partial mappings from different sources based on a mapping quality assessment heuristic. The composition method is implemented within the R2R Mapping Engine which can be used by Linked Data applications to translate Web data to their local schema.

...read moreread less

81 citations

Silk - generating RDF links while publishing or consuming linked data

[...]

Anja Jentzsch¹, Robert Isele¹, Christian Bizer¹•Institutions (1)

Free University of Berlin¹

09 Nov 2010

TL;DR: This poster presents the Silk Link Discovery Framework and reports on two usage examples in which it was employed to generate links between two data sets about movies as well as to find duplicate persons in a stream of data items that is crawled from the Web.

...read moreread less

Abstract: The central idea of the Web of Data is to interlink data items using RDF links. However, in practice most data sources are not sufficiently interlinked with related data sources. The Silk Link Discovery Framework addresses this problem by providing tools to generate links between data items based on user-provided link specifications. It can be used by data publishers to generate links between data sets as well as by Linked Data consumers to augment Web data with additional RDF links. In this poster we present the Silk Link Discovery Framework and report on two usage examples in which we employed Silk to generate links between two data sets about movies as well as to find duplicate persons in a stream of data items that is crawled from the Web.

...read moreread less

64 citations

Extending SMW+ with a linked data integration framework

[...]

Christian Becker, Christian Bizer¹, Michael Erdmann², Mark Greaves³•Institutions (3)

Free University of Berlin¹, Ontoprise GmbH², Vulcan Inc.³

09 Nov 2010

TL;DR: A large collection of neurogenomics-relevant data from the Web can be flexibly transformed into a unified ontology, allowing unified querying, navigation, and visualization; as well as support for wiki-style collaboration, crowdsourcing, and commentary on chosen data sets.

...read moreread less

Abstract: In this paper, we present a project which extends a SMW+ semantic wiki with a Linked Data Integration Framework that performs Web data access, vocabulary mapping, identity resolution, and quality evaluation of Linked Data. As a result, a large collection of neurogenomics-relevant data from the Web can be flexibly transformed into a unified ontology, allowing unified querying, navigation, and visualization; as well as support for wiki-style collaboration, crowdsourcing, and commentary on chosen data sets.

...read moreread less

12 citations

Journal Article•DOI•

Editorial: The Semantic Web Challenge, 2009

[...]

Christian Bizer¹, Peter Mika²•Institutions (2)

Free University of Berlin¹, Yahoo!²

01 Nov 2010-Journal of Web Semantics

TL;DR: The Semantic Web Challenge 2009 took place at the 8th InternationalSemantic Web Conference (ISWC 2009) near Washington, DC and provided the participants of the Billion Triples Track with a RDF data set consisting of 1.1 billion triples.

...read moreread less

9 citations

Proceedings Article•

Web of Linked Data - A global public data space on the Web.

[...]

Christian Bizer¹•Institutions (1)

Free University of Berlin¹

01 Jan 2010

6 citations

Proceedings Article•DOI•

The web of linked data: a global public dataspace on the web: WebDB 2010 keynote

[...]

Christian Bizer¹•Institutions (1)

Free University of Berlin¹

06 Jun 2010

TL;DR: Christian Bizer will introduce the principle ideas behind Linked Data and relate them to the dataspace architecture and give an overview of the identity resolution and pay-as-you-go data integration techniques that are currently used in the context of the Web of Linked data.

...read moreread less

Abstract: In 2005, Michael Franklin, Alon Halevy, and David Maier coined the term dataspaces as a new abstraction and target architecture for data management. In 2006, Tim Berners-Lee introduced the Linked Data principles a set of best practices for publishing and interlinking structured data on the Web in accordance with the general architecture of the Web. These two lines of thinking have come together and are realized in the Web of Linked Data - a global public dataspace. The Web of Linked Data was kick started by the W3C Linking Open Data community effort in 2007. Today, the Web of Linked Data consists of hundreds of datasets published by universities, companies, government and public sector bodies, as well as by individual Web enthusiasts. The content of the Web of Linked Data is diverse in nature, comprising data about people, organizations, products, geographic locations, books, scientific publications, films, music, television and radio programs, genes, proteins, drugs and clinical trials, online communities, statistical data, census results, and reviews. In addition to publishing and interlinking datasets, there is intensive work on Linked Data browsers, Web of Linked Data search engines and other applications that consume Linked Data from the Web. In his talk, Christian Bizer will introduce the principle ideas behind Linked Data and relate them to the dataspace architecture. Afterwards, he will give an overview of the identity resolution and pay-as-you-go data integration techniques that are currently used in the context of the Web of Linked Data and will explain the state-of-the-art in applications that consume Linked Data from the Web.

...read moreread less

5 citations

Journal Article•DOI•

WITHDRAWN: The Semantic Web challenge, 2009

[...]

Christian Bizer¹, Peter Mika²•Institutions (2)

Free University of Berlin¹, Yahoo!²

26 Aug 2010-Journal of Web Semantics

4 citations

Journal Article•DOI•

Semantische Mashups auf Basis des Linked Data Web

[...]

Christian Bizer¹, Christian Becker¹•Institutions (1)

Free University of Berlin¹

01 Feb 2010-Praxis Der Wirtschaftsinformatik

TL;DR: Der Artikel erläutert die technologischen Grundlagen von Linked Data and gibt anhand von Beispielen einen überblick üby den derzeitigen Entwicklungsstand semantischer Mashups.

...read moreread less

Abstract: Das World Wide Web wandelt sich von einem Medium zur Veroffentlichung von Texten zu einem Medium zur Veroffentlichung von strukturierten Daten. Neben Web-2.0-APIs spielen bei dieser Entwicklung zunehmend Linked-Data-Technologien eine zentrale Rolle. Linked-Data-Technologien ermoglichen die Vernetzung von Datenbanken mittels Datenlinks auf Basis der Webstandards HTTP-URIs und RDF (Resource Description Framework). Das Linked Data Web deckt ein breites Themenspektrum ab, unter anderem beinhaltet es Informationen zu Orten, Personen, Ereignissen, Publikationen, Musik, Filmen sowie biowissenschaftliche Daten. Semantische Mashups sind Anwendungen, die diesen Datenraum nutzen. Der Artikel erlautert die technologischen Grundlagen von Linked Data und gibt anhand von Beispielen einen uberblick uber den derzeitigen Entwicklungsstand semantischer Mashups.

...read moreread less