Home
/
Authors
/
Herbert Van de Sompel

Author

Herbert Van de Sompel

Other affiliations: Royal Netherlands Academy of Arts and Sciences, Old Dominion University, Stanford University ...read more

Bio: Herbert Van de Sompel is an academic researcher from Los Alamos National Laboratory. The author has contributed to research in topics: Digital library & Open Archives Initiative. The author has an hindex of 41, co-authored 210 publications receiving 7392 citations. Previous affiliations of Herbert Van de Sompel include Royal Netherlands Academy of Arts and Sciences & Old Dominion University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1997

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Co-authorship networks in the digital library research community

[...]

Xiaoming Liu¹, Johan Bollen², Michael L. Nelson², Herbert Van de Sompel¹•Institutions (2)

Los Alamos National Laboratory¹, Old Dominion University²

01 Dec 2005-Information Processing and Management

TL;DR: In this paper, the authors examined the state of the DL domain after a decade of activity by applying social network analysis to the co-authorship network of the past ACM, IEEE, and joint ACM/IEEE digital library conferences.

...read moreread less

Abstract: The field of digital libraries (DLs) coalesced in 1994: the first digital library conferences were held that year, awareness of the World Wide Web was accelerating, and the National Science Foundation awarded $24 Million (US) for the Digital Library Initiative (DLI). In this paper we examine the state of the DL domain after a decade of activity by applying social network analysis to the co-authorship network of the past ACM, IEEE, and joint ACM/IEEE digital library conferences. We base our analysis on a common binary undirectional network model to represent the co-authorship network, and from it we extract several established network measures. We also introduce a weighted directional network model to represent the co-authorship network, for which we define AuthorRank as an indicator of the impact of an individual author in the network. The results are validated against conference program committee members in the same period. The results show clear advantages of PageRank and AuthorRank over degree, closeness and betweenness centrality metrics. We also investigate the amount and nature of international participation in Joint Conference on Digital Libraries (JCDL).

...read moreread less

828 citations

Journal Article•DOI•

A principal component analysis of 39 scientific impact measures.

[...]

Johan Bollen¹, Herbert Van de Sompel¹, Aric Hagberg¹, Ryan Chute¹•Institutions (1)

Los Alamos National Laboratory¹

29 Jun 2009-PLOS ONE

TL;DR: The results indicate that the notion of scientific impact is a multi-dimensional construct that can not be adequately measured by any single indicator, although some measures are more suitable than others.

...read moreread less

Abstract: Background: The impact of scientific publications has traditionally been expressed in terms of citation counts. However, scientific activity has moved online over the past decade. To better capture scientific impact in the digital era, a variety of new impact measures has been proposed on the basis of social network analysis and usage log data. Here we investigate how these new measures relate to each other, and how accurately and completely they express scientific impact. Methodology: We performed a principal component analysis of the rankings produced by 39 existing and proposed measures of scholarly impact that were calculated on the basis of both citation and usage log data. Conclusions: Our results indicate that the notion of scientific impact is a multi-dimensional construct that can not be adequately measured by any single indicator, although some measures are more suitable than others. The commonly used citation Impact Factor is not positioned at the core of this construct, but at its periphery, and should thus be used with caution.

...read moreread less

544 citations

Proceedings Article•DOI•

The open archives initiative: building a low-barrier interoperability framework

[...]

Carl Lagoze¹, Herbert Van de Sompel¹•Institutions (1)

Cornell University¹

01 Jan 2001

TL;DR: The recent history of the OAI is described - its origins in promoting E-Prints, the broadening of its focus, the details of its technical standard for metadata harvesting, the applications of this standard, and future plans.

...read moreread less

Abstract: The Open Archives Initiative (OAI) develops and promotes interoperabil ity solutions that aim to facilitate the efficient dissemination of content The roots of the OAI lie in the E-Print community Over the last year its focus has been extended to include all content providers This paper describes the recent history of the OAI - its origins in promoting E-Prints, the broadening of its focus, the details of its technical standard for metadata harvesting, the applications of this standard, and future plans

...read moreread less

415 citations

Journal Article•DOI•

Journal Status

[...]

Johan Bollen, Marko A. Rodriguez, Herbert Van de Sompel

09 Jan 2006

TL;DR: In this article, the authors compare the rankings of journals according to their ISI Impact Factor and their weighted PageRank, and find that the resulting journal rankings correspond well to a general understanding of journal status.

...read moreread less

Abstract: The status of an actor in a social context is commonly defined in terms of two factors: the total number of endorsements the actor receives from other actors and the prestige of the endorsing actors. These two factors indicate the distinction between popularity and expert appreciation of the actor, respectively. We refer to the former as popularity and to the latter as prestige. These notions of popularity and prestige also apply to the domain of scholarly assessment. The ISI Impact Factor (ISI IF) is defined as the mean number of citations a journal receives over a 2 year period. By merely counting the amount of citations and disregarding the prestige of the citing journals, the ISI IF is a metric of popularity, not of prestige. We demonstrate how a weighted version of the popular PageRank algorithm can be used to obtain a metric that reflects prestige. We contrast the rankings of journals according to their ISI IF and their weighted PageRank, and we provide an analysis that reveals both significant overlaps and differences. Furthermore, we introduce the Y-factor which is a simple combination of both the ISI IF and the weighted PageRank, and find that the resulting journal rankings correspond well to a general understanding of journal status.

...read moreread less

340 citations

Journal Article•DOI•

The Santa Fe Convention of the Open Archives Initiative

[...]

Herbert Van de Sompel, Carl Lagoze

15 Feb 2000-D-lib Magazine

TL;DR: The convention presents a simple technical and organizational framework to support basic interoperability among e-print archives and participants have expressed the intention of implementing this framework to allow for interoperability experiments in the course of the year 2000.

...read moreread less

Abstract: Welcome to the Santa Fe Convention. This convention is the result of a meeting of the Open Archives Initiative which was held in Santa Fe, New Mexico, on October 21-22 1999. This convention has been endorsed unanimously by all the participants at the meeting, who represented organizations maintaining or planning e-print archives intended for open access and organizations interested in providing services, such as search interfaces or citation-linking, based on the data in those archives. The convention presents a simple technical and organizational framework to support basic interoperability among e-print archives. Participants have expressed the intention of implementing this framework to allow for interoperability experiments in the course of the year 2000. Maintainers of existing or forthcoming e-print archives that were not represented at the meeting are strongly encouraged to join this effort by implementing the framework for their archives.

...read moreread less

249 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Journal Article•DOI•

Software survey: VOSviewer, a computer program for bibliometric mapping.

[...]

Nees Jan van Eck¹, Nees Jan van Eck², Ludo Waltman², Ludo Waltman¹•Institutions (2)

Leiden University¹, Erasmus University Rotterdam²

01 Jan 2010-Scientometrics

TL;DR: VOSviewer’s ability to handle large maps is demonstrated by using the program to construct and display a co-citation map of 5,000 major scientific journals.

...read moreread less

Abstract: We present VOSviewer, a freely available computer program that we have developed for constructing and viewing bibliometric maps. Unlike most computer programs that are used for bibliometric mapping, VOSviewer pays special attention to the graphical representation of bibliometric maps. The functionality of VOSviewer is especially useful for displaying large bibliometric maps in an easy-to-interpret way. The paper consists of three parts. In the first part, an overview of VOSviewer’s functionality for displaying bibliometric maps is provided. In the second part, the technical implementation of specific parts of the program is discussed. Finally, in the third part, VOSviewer’s ability to handle large maps is demonstrated by using the program to construct and display a co-citation map of 5,000 major scientific journals.

...read moreread less

7,719 citations

Journal Article•DOI•

Linked Data - the story so far

[...]

Christian Bizer¹, Tom Heath, Tim Berners-Lee²•Institutions (2)

Free University of Berlin¹, Massachusetts Institute of Technology²

01 Jul 2009-International Journal on Semantic Web and Information Systems

TL;DR: The authors describe progress to date in publishing Linked Data on the Web, review applications that have been developed to exploit the Web of Data, and map out a research agenda for the Linked data community as it moves forward.

...read moreread less

Abstract: The term “Linked Data” refers to a set of best practices for publishing and connecting structured data on the Web. These best practices have been adopted by an increasing number of data providers over the last three years, leading to the creation of a global data space containing billions of assertions— the Web of Data. In this article, the authors present the concept and technical principles of Linked Data, and situate these within the broader context of related technological developments. They describe progress to date in publishing Linked Data on the Web, review applications that have been developed to exploit the Web of Data, and map out a research agenda for the Linked Data community as it moves forward.

...read moreread less

5,113 citations

Social Network Analysis

[...]

Tom A. B. Snijders

01 Jan 2012

3,692 citations

Journal Article•DOI•

DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia

[...]

Jens Lehmann¹, Robert Isele, Max Jakob, Anja Jentzsch², Dimitris Kontokostas¹, Pablo N. Mendes³, Sebastian Hellmann¹, Mohamed Morsey¹, Patrick van Kleef⁴, Sören Auer¹, Sören Auer⁵, Christian Bizer⁶ - Show less +8 more•Institutions (6)

Leipzig University¹, Hasso Plattner Institute², Wright State University³, OpenLink Software⁴, University of Bonn⁵, University of Mannheim⁶

01 Jan 2015-Social Work

TL;DR: An overview of the DBpedia community project is given, including its architecture, technical implementation, maintenance, internationalisation, usage statistics and applications, including DBpedia one of the central interlinking hubs in the Linked Open Data (LOD) cloud.

...read moreread less

Abstract: The DBpedia community project extracts structured, multilingual knowledge from Wikipedia and makes it freely available on the Web using Semantic Web and Linked Data technologies. The project extracts knowledge from 111 different language editions of Wikipedia. The largest DBpedia knowledge base which is extracted from the English edition of Wikipedia consists of over 400 million facts that describe 3.7 million things. The DBpedia knowledge bases that are extracted from the other 110 Wikipedia editions together consist of 1.46 billion facts and describe 10 million additional things. The DBpedia project maps Wikipedia infoboxes from 27 different language editions to a single shared ontology consisting of 320 classes and 1,650 properties. The mappings are created via a world-wide crowd-sourcing effort and enable knowledge from the different Wikipedia editions to be combined. The project publishes releases of all DBpedia knowledge bases for download and provides SPARQL query access to 14 out of the 111 language editions via a global network of local DBpedia chapters. In addition to the regular releases, the project maintains a live knowledge base which is updated whenever a page in Wikipedia changes. DBpedia sets 27 million RDF links pointing into over 30 external data sources and thus enables data from these sources to be used together with DBpedia data. Several hundred data sets on the Web publish RDF links pointing to DBpedia themselves and make DBpedia one of the central interlinking hubs in the Linked Open Data (LOD) cloud. In this system report, we give an overview of the DBpedia community project, including its architecture, technical implementation, maintenance, internationalisation, usage statistics and applications.

...read moreread less

2,856 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse