Home
/
Authors
/
Daniel Puschmann

Author

Daniel Puschmann

Bio: Daniel Puschmann is an academic researcher from University of Surrey. The author has contributed to research in topics: Data stream mining & Raw data. The author has an hindex of 7, co-authored 9 publications receiving 478 citations.

Topics: Data stream mining, Raw data, Smart city, Data stream, Cluster analysis ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

CityPulse: Large Scale Data Analytics Framework for Smart Cities

[...]

Dan Puiu¹, Payam Barnaghi², Ralf Tönjes, Daniel Kumper, Muhammad Intizar Ali³, Alessandra Mileo³, Josiane Xavier Parreira¹, Marten Fischer, Sefki Kolozali², Nazli Farajidavar², Feng Gao³, Thorben Iggena, Thu-Le Pham³, Cosmin-Septimiu Nechifor¹, Daniel Puschmann², João Paulo Fernandes - Show less +12 more•Institutions (3)

Siemens¹, University of Surrey², National University of Ireland, Galway³

05 Apr 2016-IEEE Access

TL;DR: The CityPulse framework supports smart city service creation by means of a distributed system for semantic discovery, data analytics, and interpretation of large-scale (near-)real-time Internet of Things data and social media data streams to break away from silo applications and enable cross-domain data integration.

...read moreread less

Abstract: Our world and our lives are changing in many ways. Communication, networking, and computing technologies are among the most influential enablers that shape our lives today. Digital data and connected worlds of physical objects, people, and devices are rapidly changing the way we work, travel, socialize, and interact with our surroundings, and they have a profound impact on different domains, such as healthcare, environmental monitoring, urban systems, and control and management applications, among several other areas. Cities currently face an increasing demand for providing services that can have an impact on people’s everyday lives. The CityPulse framework supports smart city service creation by means of a distributed system for semantic discovery, data analytics, and interpretation of large-scale (near-)real-time Internet of Things data and social media data streams. To goal is to break away from silo applications and enable cross-domain data integration. The CityPulse framework integrates multimodal, mixed quality, uncertain and incomplete data to create reliable, dependable information and continuously adapts data processing techniques to meet the quality of information requirements from end users. Different than existing solutions that mainly offer unified views of the data, the CityPulse framework is also equipped with powerful data analytics modules that perform intelligent data aggregation, event detection, quality assessment, contextual filtering, and decision support. This paper presents the framework, describes its components, and demonstrates how they interact to support easy development of custom-made applications for citizens. The benefits and the effectiveness of the framework are demonstrated in a use-case scenario implementation presented in this paper.

...read moreread less

199 citations

Journal Article•DOI•

A Practical Evaluation of Information Processing and Abstraction Techniques for the Internet of Things

[...]

Frieder Ganz¹, Daniel Puschmann¹, Payam Barnaghi¹, Francois Carrez¹•Institutions (1)

University of Surrey¹

06 Mar 2015-IEEE Internet of Things Journal

TL;DR: A survey of the requirements and solutions and challenges in the area of information abstraction and an efficient workflow to extract meaningful information from raw sensor data based on the current state-of-the-art in this area are provided.

...read moreread less

Abstract: The term Internet of Things (IoT) refers to the interaction and communication between billions of devices that produce and exchange data related to real-world objects (i.e. things). Extracting higher level information from the raw sensory data captured by the devices and representing this data as machine-interpretable or human-understandable information has several interesting applications. Deriving raw data into higher level information representations demands mechanisms to find, extract, and characterize meaningful abstractions from the raw data. This meaningful abstractions then have to be presented in a human and/or machine-understandable representation. However, the heterogeneity of the data originated from different sensor devices and application scenarios such as e-health, environmental monitoring, and smart home applications, and the dynamic nature of sensor data make it difficult to apply only one particular information processing technique to the underlying data. A considerable amount of methods from machine-learning, the semantic web, as well as pattern and data mining have been used to abstract from sensor observations to information representations. This paper provides a survey of the requirements and solutions and describes challenges in the area of information abstraction and presents an efficient workflow to extract meaningful information from raw sensor data based on the current state-of-the-art in this area. This paper also identifies research directions at the edge of information abstraction for sensor data. To ease the understanding of the abstraction workflow process, we introduce a software toolkit that implements the introduced techniques and motivates to apply them on various data sets.

...read moreread less

139 citations

Proceedings Article•DOI•

A Knowledge-Based Approach for Real-Time IoT Data Stream Annotation and Processing

[...]

Sefki Kolozali¹, Maria Bermudez-Edo¹, Daniel Puschmann¹, Frieder Ganz¹, Payam Barnaghi¹ - Show less +1 more•Institutions (1)

University of Surrey¹

01 Sep 2014

TL;DR: A framework for real-time semantic annotation of streaming IoT data to support dynamic integration into the Web using the Advanced Message Queuing Protocol (AMPQ) will enable delivery of large volume of data that can influence the performance of the smart city systems that use IoT data.

...read moreread less

Abstract: Internet of Things is a generic term that refers to interconnection of real-world services which are provided by smart objects and sensors that enable interaction with the physical world. Cities are also evolving into large interconnected ecosystems in an effort to improve sustainability and operational efficiency of the city services and infrastructure. However, it is often difficult to perform real-time analysis of large amount of heterogeneous data and sensory information that are provided by various sources. This paper describes a framework for real-time semantic annotation of streaming IoT data to support dynamic integration into the Web using the Advanced Message Queuing Protocol (AMPQ). This will enable delivery of large volume of data that can influence the performance of the smart city systems that use IoT data. We present an information model to represent summarisation and reliability of stream data. The framework is evaluated with the data size and average exchanged message time using summarised and raw sensor data. Based on a statistical analysis, a detailed comparison between various sensor points is made to investigate the memory and computational cost for the stream annotation framework.

...read moreread less

108 citations

Journal Article•DOI•

Adaptive Clustering for Dynamic IoT Data Streams

[...]

Daniel Puschmann¹, Payam Barnaghi¹, Rahim Tafazolli¹•Institutions (1)

University of Surrey¹

01 Feb 2017-IEEE Internet of Things Journal

TL;DR: This work proposes a method which determines how many different clusters can be found in a stream based on the data distribution, and demonstrates how the number of clusters in a real-world data stream can be determined by analyzing the data distributions.

...read moreread less

Abstract: The emergence of the Internet of Things (IoT) has led to the production of huge volumes of real-world streaming data. We need effective techniques to process IoT data streams and to gain insights and actionable information from real-world observations and measurements. Most existing approaches are application or domain dependent. We propose a method which determines how many different clusters can be found in a stream based on the data distribution. After selecting the number of clusters, we use an online clustering mechanism to cluster the incoming data from the streams. Our approach remains adaptive to drifts by adjusting itself as the data changes. We benchmark our approach against state-of-the-art stream clustering algorithms on data streams with data drift. We show how our method can be applied in a use case scenario involving near real-time traffic data. Our results allow to cluster, label, and interpret IoT data streams dynamically according to the data distribution. This enables to adaptively process large volumes of dynamic data online based on the current situation. We show how our method adapts itself to the changes. We demonstrate how the number of clusters in a real-world data stream can be determined by analyzing the data distributions.

...read moreread less

86 citations

Journal Article•DOI•

On the Effect of Adaptive and Nonadaptive Analysis of Time-Series Sensory Data

[...]

Sefki Kolozali¹, Daniel Puschmann¹, Maria Bermudez-Edo², Payam Barnaghi¹•Institutions (2)

University of Surrey¹, University of Granada²

13 Apr 2016-IEEE Internet of Things Journal

TL;DR: A framework for real-time semantic annotation and aggregation of data streams to support dynamic integration into the Web using the advanced message queuing protocol and suggests that regardless of utilized segmentation approach, it is desirable to find the optimal data aggregation parameters in order to reduce the energy consumption and improve the data aggregation quality.

...read moreread less

Abstract: With the growing popularity of information and communications technologies and information sharing and integration, cities are evolving into large interconnected ecosystems by using smart objects and sensors that enable interaction with the physical world. However, it is often difficult to perform real-time analysis of large amount on heterogeneous data and sensory information that are provided by various resources. This paper describes a framework for real-time semantic annotation and aggregation of data streams to support dynamic integration into the Web using the advanced message queuing protocol. We provide a comprehensive analysis on the effect of adaptive and nonadaptive window size in segmentation of time series using SensorSAX and symbolic aggregate approximation (SAX) approaches for data streams with different variation and sampling rate in real-time processing. The framework is evaluated with three parameters, namely window size parameter of the SAX algorithm, sensitivity level, and minimum window size parameters of the SensorSAX algorithm based on the average data aggregation and annotation time, CPU consumption, data size, and data reconstruction rate. Based on a statistical analysis, a detailed comparison between various sensor points is made to investigate the memory and computational cost of the stream-processing framework. Our results suggests that regardless of utilized segmentation approach, due to the fact that each geographically different sensory environment has got a different dynamicity level, it is desirable to find the optimal data aggregation parameters in order to reduce the energy consumption and improve the data aggregation quality.

...read moreread less

36 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Urban planning and building smart cities based on the Internet of Things using Big Data analytics

[...]

M. Mazhar Rathore¹, Awais Ahmad¹, Anand Paul¹, Seungmin Rho•Institutions (1)

Kyungpook National University¹

04 Jun 2016-Computer Networks

TL;DR: A combined IoT-based system for smart city development and urban planning using Big Data analytics, consisting of various types of sensor deployment, including smart home sensors, vehicular networking, weather and water sensors, smart parking sensors, and surveillance objects is proposed.

...read moreread less

701 citations

Journal Article•DOI•

Machine Learning for Internet of Things Data Analysis: A Survey

[...]

Mohammad Saeid Mahdavinejad¹, Mohammad Saeid Mahdavinejad², Mohammadreza Rezvan², Mohammadreza Rezvan¹, Mohammadamin Barekatain³, Peyman Adibi², Payam Barnaghi⁴, Amit P. Sheth¹ - Show less +4 more•Institutions (4)

Wright State University¹, University of Isfahan², Technische Universität München³, University of Surrey⁴

12 Oct 2017-Digital Communications and Networks

TL;DR: This article assesses the different machine learning methods that deal with the challenges in IoT data by considering smart cities as the main use case and presents a taxonomy of machine learning algorithms explaining how different techniques are applied to the data in order to extract higher level information.

...read moreread less

690 citations

Journal Article•DOI•

Edge computing technologies for Internet of Things: a primer

[...]

Yuan Ai¹, Mugen Peng¹, Kecheng Zhang¹•Institutions (1)

Beijing University of Posts and Telecommunications¹

01 Jul 2017-Digital Communications and Networks

TL;DR: This paper comprehensively presents a tutorial on three typical edge computing technologies, namely mobile edge computing, cloudlets, and fog computing, and the standardization efforts, principles, architectures, and applications of these three technologies are summarized and compared.

...read moreread less

442 citations

Journal Article•DOI•

A review of smart home applications based on Internet of Things

[...]

Mussab Alaa¹, A. A. Zaidan¹, B. B. Zaidan¹, Mohammed Talal¹, Miss Laiha Mat Kiah¹ - Show less +1 more•Institutions (1)

Information Technology University¹

01 Nov 2017-Journal of Network and Computer Applications

TL;DR: A review is conducted to map the research landscape of smart home based on Internet of Things into a coherent taxonomy and identifies the basic characteristics of this emerging field in the following aspects: motivation of using IoT in smart home applications, open challenges hindering utilization, and recommendations to improve the acceptance and use of smartHome IoT applications in literature.

...read moreread less

413 citations

Journal Article•DOI•

A survey of open source tools for machine learning with big data in the Hadoop ecosystem

[...]

Sara Landset¹, Taghi M. Khoshgoftaar¹, Aaron N. Richter¹, Tawfiq Hasanin¹•Institutions (1)

Florida Atlantic University¹

05 Nov 2015-Journal of Big Data

TL;DR: This paper provides a list of criteria for making selections along with an analysis of the advantages and drawbacks of three different processing paradigms along with a comparison of engines that implement them, including MapReduce, Spark, Flink, Storm, and H2O.

...read moreread less

Abstract: With an ever-increasing amount of options, the task of selecting machine learning tools for big data can be difficult. The available tools have advantages and drawbacks, and many have overlapping uses. The world’s data is growing rapidly, and traditional tools for machine learning are becoming insufficient as we move towards distributed and real-time processing. This paper is intended to aid the researcher or professional who understands machine learning but is inexperienced with big data. In order to evaluate tools, one should have a thorough understanding of what to look for. To that end, this paper provides a list of criteria for making selections along with an analysis of the advantages and drawbacks of each. We do this by starting from the beginning, and looking at what exactly the term “big data” means. From there, we go on to the Hadoop ecosystem for a look at many of the projects that are part of a typical machine learning architecture and an understanding of how everything might fit together. We discuss the advantages and disadvantages of three different processing paradigms along with a comparison of engines that implement them, including MapReduce, Spark, Flink, Storm, and H2O. We then look at machine learning libraries and frameworks including Mahout, MLlib, SAMOA, and evaluate them based on criteria such as scalability, ease of use, and extensibility. There is no single toolkit that truly embodies a one-size-fits-all solution, so this paper aims to help make decisions smoother by providing as much information as possible and quantifying what the tradeoffs will be. Additionally, throughout this paper, we review recent research in the field using these tools and talk about possible future directions for toolkit-based learning.

...read moreread less

379 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113

Collapse