scispace - formally typeset
Search or ask a question
Institution

University of Paderborn

EducationPaderborn, Nordrhein-Westfalen, Germany
About: University of Paderborn is a education organization based out in Paderborn, Nordrhein-Westfalen, Germany. It is known for research contribution in the topics: Control reconfiguration & Software. The organization has 6684 authors who have published 16929 publications receiving 323154 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: This article presents a discussion on eight open challenges for data stream mining, which cover the full cycle of knowledge discovery and involve such problems as protecting data privacy, dealing with legacy systems, handling incomplete and delayed information, analysis of complex data, and evaluation of stream mining algorithms.
Abstract: Every day, huge volumes of sensory, transactional, and web data are continuously generated as streams, which need to be analyzed online as they arrive. Streaming data can be considered as one of the main sources of what is called big data. While predictive modeling for data streams and big data have received a lot of attention over the last decade, many research approaches are typically designed for well-behaved controlled problem settings, overlooking important challenges imposed by real-world applications. This article presents a discussion on eight open challenges for data stream mining. Our goal is to identify gaps between current research and meaningful applications, highlight open problems, and define new application-relevant research directions for data stream mining. The identified challenges cover the full cycle of knowledge discovery and involve such problems as: protecting data privacy, dealing with legacy systems, handling incomplete and delayed information, analysis of complex data, and evaluation of stream mining algorithms. The resulting analysis is illustrated by practical applications and provides general suggestions concerning lines of future research in data stream mining.

260 citations

Proceedings Article
16 Jan 2010
TL;DR: A new k-means clustering algorithm for data streams of points from a Euclidean space that provides a good alternative to BIRCH and StreamLS, in particular, if the number of cluster centers is large.
Abstract: We develop a new k-means clustering algorithm for data streams, which we call StreamKM++. Our algorithm computes a small weighted sample of the data stream and solves the problem on the sample using the k-means++ algorithm [1]. To compute the small sample, we propose two new techniques. First, we use a non-uniform sampling approach similar to the k-means++ seeding procedure to obtain small coresets from the data stream. This construction is rather easy to implement and, unlike other coreset constructions, its running time has only a low dependency on the dimensionality of the data. Second, we propose a new data structure which we call a coreset tree. The use of these coreset trees significantly speeds up the time necessary for the non-uniform sampling during our coreset construction. We compare our algorithm experimentally with two well-known streaming implementations (BIRCH [16] and StreamLS [4, 9]). In terms of quality (sum of squared errors), our algorithm is comparable with StreamLS and significantly better than BIRCH (up to a factor of 2). In terms of running time, our algorithm is slower than BIRCH. Comparing the running time with StreamLS, it turns out that our algorithm scales much better with increasing number of centers. We conclude that, if the first priority is the quality of the clustering, then our algorithm provides a good alternative to BIRCH and StreamLS, in particular, if the number of cluster centers is large. We also give a theoretical justification of our approach by proving that our sample set is a small coreset in low dimensional spaces.

257 citations

Journal ArticleDOI
TL;DR: In this article, the state of a qubit encoded in the polarization state was demonstrated from a telecom-wavelength photon to a solid-state quantum memory via 24.8 km of optical fibre.
Abstract: Quantum teleportation of the state of a qubit encoded in the polarization state is demonstrated from a telecom-wavelength photon to a solid-state quantum memory via 24.8 km of optical fibre. It is the longest distance ever reached in a teleportation experiment involving a quantum memory.

256 citations

Journal ArticleDOI
TL;DR: This paper discusses the multi-depot, multi-vehicle-type bus scheduling problem (MDVSP), involving multiple depots for vehicles and different vehicle types for timetabled trips, and uses time–space-based instead of connection-based networks for MDVSP modeling.

256 citations

Journal ArticleDOI
TL;DR: In this article, a preliminary phytolith classification scheme was used in soil phytochemical counting procedures to produce typical opal phytophytolith spectra for comparison. But the method was not suitable to describe and characterize rain forest and grassland vegetation.

255 citations


Authors

Showing all 6872 results

NameH-indexPapersCitations
Martin Karplus163831138492
Marco Dorigo10565791418
Robert W. Boyd98116137321
Thomas Heine8442324210
Satoru Miyano8481138723
Wen-Xiu Ma8342020702
Jörg Neugebauer8149130909
Thomas Lengauer8047734430
Gotthard Seifert8044526136
Reshef Tenne7452924717
Tim Meyer7454824784
Qiang Cui7129220655
Thomas Frauenheim7045117887
Walter Richtering6733214866
Marcus Elstner6720918960
Network Information
Related Institutions (5)
Eindhoven University of Technology
52.9K papers, 1.5M citations

95% related

Royal Institute of Technology
68.4K papers, 1.9M citations

93% related

Karlsruhe Institute of Technology
82.1K papers, 2.1M citations

93% related

École Polytechnique Fédérale de Lausanne
98.2K papers, 4.3M citations

91% related

Carnegie Mellon University
104.3K papers, 5.9M citations

91% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
2023131
2022242
20211,030
20201,010
2019948
2018967