Home
/
Authors
/
Georgi Kobilarov

Author

Georgi Kobilarov

Bio: Georgi Kobilarov is an academic researcher from Free University of Berlin. The author has contributed to research in topics: Linked data & Semantic Web. The author has an hindex of 8, co-authored 8 publications receiving 7289 citations.

Topics: Linked data, Semantic Web, RDF, Knowledge extraction, SPARQL ...read more

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

DBpedia: a nucleus for a web of open data

[...]

Sören Auer¹, Christian Bizer², Georgi Kobilarov², Jens Lehmann³, Richard Cyganiak², Zachary G. Ives¹ - Show less +2 more•Institutions (3)

University of Pennsylvania¹, Free University of Berlin², Leipzig University³

11 Nov 2007

TL;DR: The extraction of the DBpedia datasets is described, and how the resulting information is published on the Web for human-andmachine-consumption and how DBpedia could serve as a nucleus for an emerging Web of open data.

...read moreread less

Abstract: DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against datasets derived from Wikipedia and to link other datasets on the Web to Wikipedia data. We describe the extraction of the DBpedia datasets, and how the resulting information is published on the Web for human-andmachine-consumption. We describe some emerging applications from the DBpedia community and show how website authors can facilitate DBpedia content within their sites. Finally, we present the current status of interlinking DBpedia with other open datasets on the Web and outline how DBpedia could serve as a nucleus for an emerging Web of open data.

...read moreread less

4,828 citations

Journal Article•DOI•

DBpedia - A crystallization point for the Web of Data

[...]

Christian Bizer¹, Jens Lehmann, Georgi Kobilarov¹, Sören Auer, Christian Becker¹, Richard Cyganiak², Sebastian Hellmann - Show less +3 more•Institutions (2)

Free University of Berlin¹, Digital Enterprise Research Institute²

01 Sep 2009-Journal of Web Semantics

TL;DR: The extraction of the DBpedia knowledge base is described, the current status of interlinking DBpedia with other data sources on the Web is discussed, and an overview of applications that facilitate the Web of Data around DBpedia is given.

...read moreread less

2,224 citations

Silk - A Link Discovery Framework for the Web of Data

[...]

Julius Volz¹, Christian Bizer, Martin Gaedke¹, Georgi Kobilarov²•Institutions (2)

Chemnitz University of Technology¹, Free University of Berlin²

01 Apr 2009

TL;DR: The Silk - Link Discovery Framework is presented, a tool for finding relationships between entities within different data sources and features a declarative language for specifying which types of RDF links should be discovered between data sources as well as which conditions entities must fulfill in order to be interlinked.

...read moreread less

Abstract: Web of Data is built upon two simple ideas: Employ the RDF data model to publish structured data on the Web and to set explicit RDF links between entities within different data sources. This paper presents the Silk - Link Discovery Framework, a tool for finding relationships between entities within different data sources. Data publishers can use Silk to set RDF links from their data sources to other data sources on the Web. Silk features a declarative language for specifying which types of RDF links should be discovered between data sources as well as which conditions entities must fulfill in order to be interlinked. Link conditions may be based on various similarity metrics and can take the graph around entities into account, which is addressed using a path-based selector language. Silk accesses data sources over the SPARQL protocol and can thus be used without having to replicate datasets locally.

...read moreread less

464 citations

Book Chapter•DOI•

Discovering and Maintaining Links on the Web of Data

[...]

Julius Volz¹, Christian Bizer², Martin Gaedke¹, Georgi Kobilarov²•Institutions (2)

Chemnitz University of Technology¹, Free University of Berlin²

06 Nov 2009

TL;DR: The Silk --- Linking Framework is presented, a toolkit for discovering and maintaining data links between Web data sources and allows data sources to exchange both linksets as well as detailed change information and enables continuous link recomputation.

...read moreread less

Abstract: The Web of Data is built upon two simple ideas: Employ the RDF data model to publish structured data on the Web and to create explicit data links between entities within different data sources. This paper presents the Silk --- Linking Framework, a toolkit for discovering and maintaining data links between Web data sources. Silk consists of three components: 1. A link discovery engine, which computes links between data sources based on a declarative specification of the conditions that entities must fulfill in order to be interlinked; 2. A tool for evaluating the generated data links in order to fine-tune the linking specification; 3. A protocol for maintaining data links between continuously changing data sources. The protocol allows data sources to exchange both linksets as well as detailed change information and enables continuous link recomputation. The interplay of all the components is demonstrated within a life science use case.

...read moreread less

353 citations

Book Chapter•DOI•

Media Meets Semantic Web --- How the BBC Uses DBpedia and Linked Data to Make Connections

[...]

Georgi Kobilarov¹, Thomas F. McNair Scott, Yves Raimond, Silver Oliver, Chris Sizemore, Michael Smethurst, Christian Bizer¹, Robert J. Lee - Show less +4 more•Institutions (1)

Free University of Berlin¹

31 May 2009

TL;DR: Current projects, ongoing development, and further research are described in a joint collaboration between the BBC, Freie Universitat Berlin and Rattle Research in order to use DBpedia as the controlled vocabulary and semantic backbone for the whole BBC.

...read moreread less

Abstract: In this paper, we describe how the BBC is working to integrate data and linking documents across BBC domains by using Semantic Web technology, in particular Linked Data, MusicBrainz and DBpedia. We cover the work of BBC Programmes and BBC Music building Linked Data sites for all music and programmes related brands, and we describe existing projects, ongoing development, and further research we are doing in a joint collaboration between the BBC, Freie Universitat Berlin and Rattle Research in order to use DBpedia as the controlled vocabulary and semantic backbone for the whole BBC.

...read moreread less

297 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Linked Data - the story so far

[...]

Christian Bizer¹, Tom Heath, Tim Berners-Lee²•Institutions (2)

Free University of Berlin¹, Massachusetts Institute of Technology²

01 Jul 2009-International Journal on Semantic Web and Information Systems

TL;DR: The authors describe progress to date in publishing Linked Data on the Web, review applications that have been developed to exploit the Web of Data, and map out a research agenda for the Linked data community as it moves forward.

...read moreread less

Abstract: The term “Linked Data” refers to a set of best practices for publishing and connecting structured data on the Web. These best practices have been adopted by an increasing number of data providers over the last three years, leading to the creation of a global data space containing billions of assertions— the Web of Data. In this article, the authors present the concept and technical principles of Linked Data, and situate these within the broader context of related technological developments. They describe progress to date in publishing Linked Data on the Web, review applications that have been developed to exploit the Web of Data, and map out a research agenda for the Linked Data community as it moves forward.

...read moreread less

5,113 citations

Journal Article•DOI•

DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia

[...]

Jens Lehmann¹, Robert Isele, Max Jakob, Anja Jentzsch², Dimitris Kontokostas¹, Pablo N. Mendes³, Sebastian Hellmann¹, Mohamed Morsey¹, Patrick van Kleef⁴, Sören Auer⁵, Sören Auer¹, Christian Bizer⁶ - Show less +8 more•Institutions (6)

Leipzig University¹, Hasso Plattner Institute², Wright State University³, OpenLink Software⁴, University of Bonn⁵, University of Mannheim⁶

01 Jan 2015-Social Work

TL;DR: An overview of the DBpedia community project is given, including its architecture, technical implementation, maintenance, internationalisation, usage statistics and applications, including DBpedia one of the central interlinking hubs in the Linked Open Data (LOD) cloud.

...read moreread less

Abstract: The DBpedia community project extracts structured, multilingual knowledge from Wikipedia and makes it freely available on the Web using Semantic Web and Linked Data technologies. The project extracts knowledge from 111 different language editions of Wikipedia. The largest DBpedia knowledge base which is extracted from the English edition of Wikipedia consists of over 400 million facts that describe 3.7 million things. The DBpedia knowledge bases that are extracted from the other 110 Wikipedia editions together consist of 1.46 billion facts and describe 10 million additional things. The DBpedia project maps Wikipedia infoboxes from 27 different language editions to a single shared ontology consisting of 320 classes and 1,650 properties. The mappings are created via a world-wide crowd-sourcing effort and enable knowledge from the different Wikipedia editions to be combined. The project publishes releases of all DBpedia knowledge bases for download and provides SPARQL query access to 14 out of the 111 language editions via a global network of local DBpedia chapters. In addition to the regular releases, the project maintains a live knowledge base which is updated whenever a page in Wikipedia changes. DBpedia sets 27 million RDF links pointing into over 30 external data sources and thus enables data from these sources to be used together with DBpedia data. Several hundred data sets on the Web publish RDF links pointing to DBpedia themselves and make DBpedia one of the central interlinking hubs in the Linked Open Data (LOD) cloud. In this system report, we give an overview of the DBpedia community project, including its architecture, technical implementation, maintenance, internationalisation, usage statistics and applications.

...read moreread less

2,856 citations

Journal Article•DOI•

Wikidata: a free collaborative knowledgebase

[...]

Denny Vrandecic¹, Markus Krötzsch²•Institutions (2)

Google¹, Dresden University of Technology²

23 Sep 2014-Communications of The ACM

TL;DR: This collaboratively edited knowledgebase provides a common source of data for Wikipedia, and everyone else, to help improve the quality of the encyclopedia.

...read moreread less

Abstract: This collaboratively edited knowledgebase provides a common source of data for Wikipedia, and everyone else.

...read moreread less

2,809 citations

Book•

Ontology Matching

[...]

Jérôme Euzenat, Pavel Shvaiko

05 Jun 2007

TL;DR: The second edition of Ontology Matching has been thoroughly revised and updated to reflect the most recent advances in this quickly developing area, which resulted in more than 150 pages of new content.

...read moreread less

Abstract: Ontologies tend to be found everywhere. They are viewed as the silver bullet for many applications, such as database integration, peer-to-peer systems, e-commerce, semantic web services, or social networks. However, in open or evolving systems, such as the semantic web, different parties would, in general, adopt different ontologies. Thus, merely using ontologies, like using XML, does not reduce heterogeneity: it just raises heterogeneity problems to a higher level. Euzenat and Shvaikos book is devoted to ontology matching as a solution to the semantic heterogeneity problem faced by computer systems. Ontology matching aims at finding correspondences between semantically related entities of different ontologies. These correspondences may stand for equivalence as well as other relations, such as consequence, subsumption, or disjointness, between ontology entities. Many different matching solutions have been proposed so far from various viewpoints, e.g., databases, information systems, and artificial intelligence. The second edition of Ontology Matching has been thoroughly revised and updated to reflect the most recent advances in this quickly developing area, which resulted in more than 150 pages of new content. In particular, the book includes a new chapter dedicated to the methodology for performing ontology matching. It also covers emerging topics, such as data interlinking, ontology partitioning and pruning, context-based matching, matcher tuning, alignment debugging, and user involvement in matching, to mention a few. More than 100 state-of-the-art matching systems and frameworks were reviewed. With Ontology Matching, researchers and practitioners will find a reference book that presents currently available work in a uniform framework. In particular, the work and the techniques presented in this book can be equally applied to database schema matching, catalog integration, XML schema matching and other related problems. The objectives of the book include presenting (i) the state of the art and (ii) the latest research results in ontology matching by providing a systematic and detailed account of matching techniques and matching systems from theoretical, practical and application perspectives.

...read moreread less

2,579 citations

Journal Article•DOI•

Quantitative analysis of culture using millions of digitized books

[...]

Jean-Baptiste Michel¹, Yuan Kui Shen², Yuan Kui Shen¹, Aviva Presser Aiden¹, Adrian Veres¹, Matthew K. Gray³, Joseph P. Pickett, Dale Hoiberg, Dan Clancy³, Peter Norvig³, Jon Orwant³, Steven Pinker¹, Martin A. Nowak¹, Erez Lieberman Aiden - Show less +10 more•Institutions (3)

Harvard University¹, Massachusetts Institute of Technology², Google³

14 Jan 2011-Science

TL;DR: This work surveys the vast terrain of ‘culturomics,’ focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000, and shows how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology and the pursuit of fame.

...read moreread less

Abstract: We constructed a corpus of digitized texts containing about 4% of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of 'culturomics,' focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. Culturomics extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.

...read moreread less

2,257 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse