Home
/
Topics
/
Spark (mathematics)

Topic

Spark (mathematics)

About: Spark (mathematics) is a research topic. Over the lifetime, 7304 publications have been published within this topic receiving 63322 citations.

...read moreread less

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969
1968

1 / 3

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Methodology for Spark Parameter Tuning

[...]

Anastasios Gounaris¹, Jordi Torres²•Institutions (2)

Aristotle University of Thessaloniki¹, Polytechnic University of Catalonia²

19 May 2017-Big Data Research

TL;DR: An alternative systematic methodology for parameter tuning is proposed, which can be easily applied onto any computing infrastructure and is shown to yield comparable if not better results than the initial one when applied to MN3; observed speedups in the validating test case studies start from 20%.

...read moreread less

56 citations

Journal Article•DOI•

Historical Advances in Spark Emission Spectroscopy

[...]

John P. Walters¹•Institutions (1)

University of Wisconsin-Madison¹

01 Jul 1969-Applied Spectroscopy

TL;DR: In this article, major developments in the understanding of the physics and chemistry of the atmospheric pressure spark discharge are presented and commented upon, including work in the areas of equipment, initial gap breakdown, spark channel formation, electrode sampling phenomena, sample propagation phenomena, excited state production, related plasma physics, and counter electrode phenomena.

...read moreread less

Abstract: Major developments in the understanding of the physics and chemistry of the atmospheric pressure spark discharge are presented and commented upon. These include work in the areas of equipment, initial gap breakdown, spark channel formation, electrode sampling phenomena, sample propagation phenomena, excited state production, related plasma physics, and counter electrode phenomena.

...read moreread less

56 citations

Proceedings Article•DOI•

Large interactive visualization of density functions on big data infrastructure

[...]

Alexandre Perrot¹, Romain Bourqui¹, Nicolas Hanusse¹, Frédéric Lalanne¹, David Auber¹ - Show less +1 more•Institutions (1)

L'Abri¹

25 Oct 2015

TL;DR: This paper presents a complete architecture which fully fits into the Big Data paradigm and so enables interactive visualization of heatmaps at ultra-scale and an adaptive GPU based method for kernel density estimation is proposed.

...read moreread less

Abstract: Point set visualization is required in lots of visualization techniques. Scatter plots as well as geographic heat-maps are straightforward examples. Data analysts are now well trained to use such visualization techniques. The availability of larger and larger datasets raises the need to make these techniques scale as fast as the data grows. The Big Data Infrastructure offers the possibility to scale horizontally. Designing point set visualization methods that fit into that new paradigm is thus a crucial challenge. In this paper, we present a complete architecture which fully fits into the Big Data paradigm and so enables interactive visualization of heatmaps at ultra-scale. A new distributed algorithm for multi-scale aggregation of point set is given and an adaptive GPU based method for kernel density estimation is proposed. A complete prototype working with Hadoop, HBase, Spark and WebGL has been implemented. We give a benchmark of our solution on a dataset having more than 2 billion points.

...read moreread less

56 citations

Journal Article•DOI•

Feedback linearization of spark-ignition engines with continuously variable transmissions

[...]

Lino Guzzella¹, A.M. Schmid¹•Institutions (1)

ETH Zurich¹

01 Mar 1995-IEEE Transactions on Control Systems and Technology

TL;DR: This paper discusses a possible approach to the control of this type of drive-train structures for a specific operating condition ("high-power regime") using feedback linearization and a "kick-down"-controller.

...read moreread less

Abstract: Replacing conventional gear-boxes with continuously variable transmissions (CVT's) can reduce the fuel-consumption of spark ignition engines significantly. A possible approach to the control of this type of drive-train structures for a specific operating condition ("high-power regime") is discussed in this paper. In the first part the plant dynamics are exactly linearized over the complete operating range using feedback linearization. Much attention is paid to the existence conditions for this nonlinear part. In the second part, as an application of the exact linearization approach, a "kick-down"-controller is designed. Simulations show that combining the two controllers yields good transient behavior and robustness of the closed-loop system. >

...read moreread less

55 citations

Proceedings Article•DOI•

StreamApprox: approximate computing for stream analytics

[...]

Do Le Quoc¹, Ruichuan Chen², Pramod Bhatotia³, Christof Fetzer¹, Volker Hilt², Thorsten Strufe¹ - Show less +2 more•Institutions (3)

Dresden University of Technology¹, Bell Labs², University of Edinburgh³

11 Dec 2017

TL;DR: An online stratified reservoir sampling algorithm to produce approximate output with rigorous error bounds is designed and can be applied to two prominent types of stream processing systems: (1) batched stream processing such as Apache Spark Streaming, and (2) pipelined stream processingsuch as Apache Flink.

...read moreread less

Abstract: Approximate computing aims for efficient execution of workflows where an approximate output is sufficient instead of the exact output. The idea behind approximate computing is to compute over a representative sample instead of the entire input dataset. Thus, approximate computing --- based on the chosen sample size --- can make a systematic trade-off between the output accuracy and computation efficiency. Unfortunately, the state-of-the-art systems for approximate computing primarily target batch analytics, where the input data remains unchanged during the course of computation. Thus, they are not well-suited for stream analytics. This motivated the design of StreamApprox--- a stream analytics system for approximate computing. To realize this idea, we designed an online stratified reservoir sampling algorithm to produce approximate output with rigorous error bounds. Importantly, our proposed algorithm is generic and can be applied to two prominent types of stream processing systems: (1) batched stream processing such as Apache Spark Streaming, and (2) pipelined stream processing such as Apache Flink. To showcase the effectiveness of our algorithm, we implemented StreamApprox as a fully functional prototype based on Apache Spark Streaming and Apache Flink. We evaluated StreamApprox using a set of microbenchmarks and real-world case studies. Our results show that Spark- and Flink-based StreamApprox systems achieve a speedup of 1.15×---3× compared to the respective native Spark Streaming and Flink executions, with varying sampling fraction of 80% to 10%. Furthermore, we have also implemented an improved baseline in addition to the native execution baseline --- a Spark-based approximate computing system leveraging the existing sampling modules in Apache Spark. Compared to the improved baseline, our results show that StreamApprox achieves a speedup of 1.1×---2.4× while maintaining the same accuracy level.

...read moreread less

55 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
…
30
31
32
33
34
35
36
…
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,304

Papers

74,604

Citations

No. of papers in the topic in previous years
Year	Papers
2022	10
2021	429
2020	525
2019	661
2018	758
2017	683

Spark (mathematics)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics