Home
/
Topics
/
Spark (mathematics)

Topic

Spark (mathematics)

About: Spark (mathematics) is a research topic. Over the lifetime, 7304 publications have been published within this topic receiving 63322 citations.

...read moreread less

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969
1968

1 / 3

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

A Comparative Survey of the HPC and Big Data Paradigms: Analysis and Experiments

[...]

Hamidreza Asaadi¹, Dounia Khaldi², Barbara Chapman²•Institutions (2)

Stony Brook University¹, University of Houston²

01 Sep 2016

TL;DR: This paper presents a data-supported, comparative survey of the main current HPC and Big Data programming interfaces, namely MPI, OpenMP, PGAS (OpenSHMEM), Spark, and Hadoop, and their software stacks and a comprehensive experimental study of these interfaces on a set of benchmarks.

...read moreread less

Abstract: Many scientific data analytic applications need huge amounts of input, which can often consist of more than several TBs of data. This emphasizes the high I/O and processing/computational cost requirements of these algorithms. Tasks in these programs can induce more I/O operations than computations or the opposite. Hardware also includes nodes with large storage devices and/or nodes with sophisticated computational capabilities. To embrace the heterogeneity of the hardware systems in non-cloud and cloud environments, the issues of resource and job allocation in these environments need to be revisited. High-Performance Computing models, under the leadership of MPI (plus OpenMP) parallel APIs, have mostly met users' requirements in terms of high computational performance, while Big Data frameworks such as Spark have performed likewise in terms of high-level programming, resiliency and I/O handling. Therefore, in order to meet the specialized needs of scientists, there is a need for convergence between HPC and Big Data ecosystems. This paper presents a data-supported, comparative survey of the main current HPC and Big Data programming interfaces, namely MPI, OpenMP, PGAS (OpenSHMEM), Spark, and Hadoop, and their software stacks. A comprehensive experimental study of these interfaces on a set of benchmarks, namely reduction and I/O microbenchmarks, the StackExchange AnswersCount benchmark, and PageRank Benchmark has been performed on a single platform in order to achieve a fair comparison. These experiments lead to a thorough discussion about whether the envisioned convergence is needed or not, efficient or not, and whether it is the best solution to tackle future computational challenges.

...read moreread less

23 citations

Journal Article•DOI•

Automatically generated zonal models for building air flow simulation: principles and applications

[...]

Marjorie Musy¹, Frederick Winkelmann², Etienne Wurtz³, Anne Sergent•Institutions (3)

École Normale Supérieure¹, University of La Rochelle², Lawrence Berkeley National Laboratory³

01 Aug 2002-Building and Environment

TL;DR: In this article, the authors present a model-generating tool called GenSPARK, which constructs a zonal model of an entire building by assembling the appropriate modules, and solves the set of equations resulting from this construction to obtain the air flow and temperature distribution in the building.

...read moreread less

23 citations

Proceedings Article•DOI•

Sooting Tendencies in an Air-Forced Direct Injection Spark-Ignition (DISI) Engine

[...]

M. Matti Maricq¹, Ruben H. Munoz¹, Jialin Yang¹, Richard W. Anderson¹•Institutions (1)

Ford Motor Company¹

06 Mar 2000-SAE transactions

23 citations

Journal Article•DOI•

An experimental and analytical examination of the combustion period for gas-fuelled spark ignition engine applications:

[...]

Bade S. O. Shrestha¹, G. A. Karim¹•Institutions (1)

University of Calgary¹

01 Feb 2001

TL;DR: In this article, a predictive procedure is described for determining the effective time period needed to complete the energy release by combustion from the moment of flame initiation by a spark to the completion of flame propagation in a spark ignition engine while using a number of gaseous fuels and some of their mixtures.

...read moreread less

Abstract: A predictive procedure is described for determining the effective time period needed to complete the energy release by combustion from the moment of flame initiation by a spark to the completion of flame propagation in a spark ignition engine while using a number of gaseous fuels and some of their mixtures. These predicted values of the combustion period when used in a relatively simple modelling procedure can produce predicted values of key engine performance parameters that compare well with the corresponding experimentally obtained values.

...read moreread less

23 citations

Proceedings Article•DOI•

Collaborative filtering recommendation algorithm based on Hadoop and Spark

[...]

Bartosz Kupisz¹, Olgierd Unold¹•Institutions (1)

Wrocław University of Technology¹

17 Mar 2015

TL;DR: The aim of this work was to develop and compare recommendation systems which use the item-based collaborative filtering algorithm, based on Hadoop and Spark, and the Tanimoto coefficient which provides the most precise results for the available data.

...read moreread less

Abstract: The aim of this work was to develop and compare recommendation systems which use the item-based collaborative filtering algorithm, based on Hadoop and Spark. Data for the research were gathered from a real social portal the users of which can express their preferences regarding the applications on offer. The Hadoop version was implemented with the use of the Mahout library which was an element of the Hadoop ecosystem. The authors original solution was implemented with the use of the Apache Spark platform and the Scala programming language. The applied similarity measure was the Tanimoto coefficient which provides the most precise results for the available data. The initial assumptions were confirmed as the solution based on the Apache Spark platform turned out to be more efficient.

...read moreread less

23 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
…
123
124
125
126
127
128
129
…
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,304

Papers

74,604

Citations

No. of papers in the topic in previous years
Year	Papers
2022	10
2021	429
2020	525
2019	661
2018	758
2017	683

Spark (mathematics)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics