Home
/
Authors
/
Johann-Christoph Freytag

Author

Johann-Christoph Freytag

Other affiliations: Humboldt State University, IBM

Bio: Johann-Christoph Freytag is an academic researcher from Humboldt University of Berlin. The author has contributed to research in topics: Query optimization & Query language. The author has an hindex of 27, co-authored 99 publications receiving 3132 citations. Previous affiliations of Johann-Christoph Freytag include Humboldt State University & IBM.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1993
1989
1988
1987
1986

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The Stratosphere platform for big data analytics

[...]

Alexander Alexandrov¹, Rico Bergmann², Stephan Ewen¹, Johann-Christoph Freytag², Fabian Hueske¹, Arvid Heise³, Odej Kao¹, Marcus Leich¹, Ulf Leser², Volker Markl¹, Felix Naumann³, Mathias Peters², Astrid Rheinländer², Matthias J. Sax², Sebastian Schelter¹, Mareike Hoger¹, Kostas Tzoumas¹, Daniel Warneke⁴ - Show less +14 more•Institutions (4)

Technical University of Berlin¹, Humboldt University of Berlin², Hasso Plattner Institute³, International Computer Science Institute⁴

01 Dec 2014

TL;DR: The overall system architecture design decisions are presented, Stratosphere is introduced through example queries, and the internal workings of the system’s components that relate to extensibility, programming model, optimization, and query execution are dive into.

...read moreread less

Abstract: We present Stratosphere, an open-source software stack for parallel data analysis. Stratosphere brings together a unique set of features that allow the expressive, easy, and efficient programming of analytical applications at very large scale. Stratosphere's features include "in situ" data processing, a declarative query language, treatment of user-defined functions as first-class citizens, automatic program parallelization and optimization, support for iterative programs, and a scalable and efficient execution engine. Stratosphere covers a variety of "Big Data" use cases, such as data warehousing, information extraction and integration, data cleansing, graph analysis, and statistical analysis applications. In this paper, we present the overall system architecture design decisions, introduce Stratosphere through example queries, and then dive into the internal workings of the system's components that relate to extensibility, programming model, optimization, and query execution. We experimentally compare Stratosphere against popular open-source alternatives, and we conclude with a research outlook for the next years.

...read moreread less

491 citations

Book Chapter•DOI•

Executing SPARQL Queries over the Web of Linked Data

[...]

Olaf Hartig¹, Christian Bizer², Johann-Christoph Freytag¹•Institutions (2)

Humboldt University of Berlin¹, Free University of Berlin²

06 Nov 2009

TL;DR: An approach to execute SPARQL queries over the Web of Linked Data using an iterator-based pipeline to discover data that might be relevant for answering a query during the query execution itself and an extension of the iterator paradigm is proposed.

...read moreread less

Abstract: The Web of Linked Data forms a single, globally distributed dataspace. Due to the openness of this dataspace, it is not possible to know in advance all data sources that might be relevant for query answering. This openness poses a new challenge that is not addressed by traditional research on federated query processing. In this paper we present an approach to execute SPARQL queries over the Web of Linked Data. The main idea of our approach is to discover data that might be relevant for answering a query during the query execution itself. This discovery is driven by following RDF links between data sources based on URIs in the query and in partial results. The URIs are resolved over the HTTP protocol into RDF data which is continuously added to the queried dataset. This paper describes concepts and algorithms to implement our approach using an iterator-based pipeline. We introduce a formalization of the pipelining approach and show that classical iterators may cause blocking due to the latency of HTTP requests. To avoid blocking, we propose an extension of the iterator paradigm. The evaluation of our approach shows its strengths as well as the still existing challenges.

...read moreread less

386 citations

Journal Article•DOI•

Extensible query processing in starburst

[...]

Laura M. Haas¹, Johann-Christoph Freytag¹, Guy M. Lohman¹, Hamid Pirahesh¹•Institutions (1)

IBM¹

01 Jun 1989

TL;DR: The design of Starburst's query language processor is described and the ways in which the language processor can be extended to achieve the project's goals are discussed.

...read moreread less

Abstract: Today's DBMSs are unable to support the increasing demands of the various applications that would like to use a DBMS. Each kind of application poses new requirements for the DBMS. The Starburst project at IBM's Almaden Research Center aims to extend relational DBMS technology to bridge this gap between applications and the DBMS. While providing a full function relational system to enable sharing across applications, Starburst will also allow (sophisticated) programmers to add many kinds of extensions to the base system's capabilities, including language extensions (e.g., new datatypes and operations), data management extensions (e.g., new access and storage methods) and internal processing extensions (e.g., new join methods and new query transformations). To support these features, the database query language processor must be very powerful and highly extensible. Starburst's language processor features a powerful query language, rule-based optimization and query rewrite, and an execution system based on an extended relational algebra. In this paper, we describe the design of Starburst's query language processor and discuss the ways in which the language processor can be extended to achieve Starburst's goals.

...read moreread less

335 citations

Proceedings Article•DOI•

Quality-driven Integration of Heterogenous Information Systems

[...]

Felix Naumann¹, Ulf Leser², Johann-Christoph Freytag¹•Institutions (2)

Humboldt University of Berlin¹, Technical University of Berlin²

07 Sep 1999

TL;DR: A framework for multidatabase query processing that fully includes the quality of information in many facets, such as completeness, timeliness, accuracy, etc, is described.

...read moreread less

Abstract: Integrated access to information that is spread over multiple, distributed, and heterogeneous sources is an important problem in many scienti c and commercial domains. While much work has been done on query processing and choosing plans under cost criteria, very little is known about the important problem of incorporating the information quality aspect into query planning. In this paper we describe a framework for multidatabase query processing that fully includes the quality of information in many facets, such as completeness, timeliness, accuracy, etc. We seamlessly include information quality into a multidatabase query processor based on a view-rewriting mechanism. We model information quality at di erent levels to ultimately nd a set of high-quality queryanswering plans.

...read moreread less

243 citations

Proceedings Article•DOI•

A rule-based view of query optimization

[...]

Johann-Christoph Freytag¹•Institutions (1)

IBM¹

01 Dec 1987

TL;DR: This paper describes its operations by transformation rules which generate different QEPs from initial query specifications and hopes that the approach taken will contribute to the more general goal of a modular query optimizer as part of an extensible database management system.

...read moreread less

Abstract: The query optimizer is an important system component of a relational database management system (DBMS). It is the responsibility of this component to translate the user-submitted query - usually written in a non-procedural language - into an efficient query evaluation plan (QEP) which is then executed against the database. The research literature describes a wide variety of optimization strategies for different query languages and implementation environments. However, very little is known about how to design and structure the query optimization component to implement these strategies.This paper proposes a first step towards the design of a modular query optimizer. We describe its operations by transformation rules which generate different QEPs from initial query specifications. As we distinguish different aspects of the query optimization process, our hope is that the approach taken in this paper will contribute to the more general goal of a modular query optimizer as part of an extensible database management system.

...read moreread less

159 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Linked Data - the story so far

[...]

Christian Bizer¹, Tom Heath, Tim Berners-Lee²•Institutions (2)

Free University of Berlin¹, Massachusetts Institute of Technology²

01 Jul 2009-International Journal on Semantic Web and Information Systems

TL;DR: The authors describe progress to date in publishing Linked Data on the Web, review applications that have been developed to exploit the Web of Data, and map out a research agenda for the Linked data community as it moves forward.

...read moreread less

Abstract: The term “Linked Data” refers to a set of best practices for publishing and connecting structured data on the Web. These best practices have been adopted by an increasing number of data providers over the last three years, leading to the creation of a global data space containing billions of assertions— the Web of Data. In this article, the authors present the concept and technical principles of Linked Data, and situate these within the broader context of related technological developments. They describe progress to date in publishing Linked Data on the Web, review applications that have been developed to exploit the Web of Data, and map out a research agenda for the Linked Data community as it moves forward.

...read moreread less

5,113 citations

Journal Article•DOI•

QoS-aware middleware for Web services composition

[...]

Liangzhao Zeng¹, Boualem Benatallah, Anne H. H. Ngu², Marlon Dumas³, Jayant R. Kalagnanam¹, Henry Chang¹ - Show less +2 more•Institutions (3)

IBM¹, Texas State University², IEEE Computer Society³

01 May 2004-IEEE Transactions on Software Engineering

TL;DR: This paper presents a middleware platform which addresses the issue of selecting Web services for the purpose of their composition in a way that maximizes user satisfaction expressed as utility functions over QoS attributes, while satisfying the constraints set by the user and by the structure of the composite service.

...read moreread less

Abstract: The paradigmatic shift from a Web of manual interactions to a Web of programmatic interactions driven by Web services is creating unprecedented opportunities for the formation of online business-to-business (B2B) collaborations. In particular, the creation of value-added services by composition of existing ones is gaining a significant momentum. Since many available Web services provide overlapping or identical functionality, albeit with different quality of service (QoS), a choice needs to be made to determine which services are to participate in a given composite service. This paper presents a middleware platform which addresses the issue of selecting Web services for the purpose of their composition in a way that maximizes user satisfaction expressed as utility functions over QoS attributes, while satisfying the constraints set by the user and by the structure of the composite service. Two selection approaches are described and compared: one based on local (task-level) selection of services and the other based on global allocation of tasks to services using integer programming.

...read moreread less

2,872 citations

Proceedings Article•DOI•

Data integration: a theoretical perspective

[...]

Maurizio Lenzerini¹•Institutions (1)

Sapienza University of Rome¹

03 Jun 2002

TL;DR: The tutorial is focused on some of the theoretical issues that are relevant for data integration: modeling a data integration application, processing queries in data integration, dealing with inconsistent data sources, and reasoning on queries.

...read moreread less

Abstract: Data integration is the problem of combining data residing at different sources, and providing the user with a unified view of these data. The problem of designing data integration systems is important in current real world applications, and is characterized by a number of issues that are interesting from a theoretical point of view. This document presents on overview of the material to be presented in a tutorial on data integration. The tutorial is focused on some of the theoretical issues that are relevant for data integration. Special attention will be devoted to the following aspects: modeling a data integration application, processing queries in data integration, dealing with inconsistent data sources, and reasoning on queries.

...read moreread less

2,716 citations

[서평]「Applied Cryptography」

[...]

염흥렬

01 Apr 1997

TL;DR: The objective of this paper is to give a comprehensive introduction to applied cryptography with an engineer or computer scientist in mind on the knowledge needed to create practical systems which supports integrity, confidentiality, or authenticity.

...read moreread less

Abstract: The objective of this paper is to give a comprehensive introduction to applied cryptography with an engineer or computer scientist in mind. The emphasis is on the knowledge needed to create practical systems which supports integrity, confidentiality, or authenticity. Topics covered includes an introduction to the concepts in cryptography, attacks against cryptographic systems, key use and handling, random bit generation, encryption modes, and message authentication codes. Recommendations on algorithms and further reading is given in the end of the paper. This paper should make the reader able to build, understand and evaluate system descriptions and designs based on the cryptographic components described in the paper.

...read moreread less

2,188 citations

Book•

Linked Data: Evolving the Web into a Global Data Space

[...]

Tom Heath, Christian Bizer, Bebo White¹•Institutions (1)

SLAC National Accelerator Laboratory¹

02 Feb 2011

TL;DR: This Synthesis lecture provides readers with a detailed technical introduction to Linked Data, including coverage of relevant aspects of Web architecture, as the basis for application development, research or further study.

...read moreread less

Abstract: The World Wide Web has enabled the creation of a global information space comprising linked documents. As the Web becomes ever more enmeshed with our daily lives, there is a growing desire for direct access to raw data not currently available on the Web or bound up in hypertext documents. Linked Data provides a publishing paradigm in which not only documents, but also data, can be a first class citizen of the Web, thereby enabling the extension of the Web with a global data space based on open standards - the Web of Data. In this Synthesis lecture we provide readers with a detailed technical introduction to Linked Data. We begin by outlining the basic principles of Linked Data, including coverage of relevant aspects of Web architecture. The remainder of the text is based around two main themes - the publication and consumption of Linked Data. Drawing on a practical Linked Data scenario, we provide guidance and best practices on: architectural approaches to publishing Linked Data; choosing URIs and vocabularies to identify and describe resources; deciding what data to return in a description of a resource on the Web; methods and frameworks for automated linking of data sets; and testing and debugging approaches for Linked Data deployments. We give an overview of existing Linked Data applications and then examine the architectures that are used to consume Linked Data from the Web, alongside existing tools and frameworks that enable these. Readers can expect to gain a rich technical understanding of Linked Data fundamentals, as the basis for application development, research or further study.

...read moreread less

2,174 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse