scispace - formally typeset
Search or ask a question
Topic

Spark (mathematics)

About: Spark (mathematics) is a research topic. Over the lifetime, 7304 publications have been published within this topic receiving 63322 citations.


Papers
More filters
Patent
01 Feb 1944
TL;DR: In this article, an object of the invention is to improve spark plug electrodes and the plugs in which they are incorporated. But this is not the main object of this paper, it is more related to spark plugs and electrodes therefor.
Abstract: This invention relates to spark plugs and electrodes therefor. An object of the invention is to improve spark plug electrodes and the plugs in which they are incorporated. Other objects of the invention will be apparent from the description and claims, In the drawing: Figure 1 shows a spark...

19 citations

Proceedings ArticleDOI
25 Mar 2017
TL;DR: The performance study shows that DFPS algorithm is more excellent than YAFIM, especially when the length of transactions is long, the number of items is large and the data is massive, and DFPS has an excellent scalability.
Abstract: Frequent Itemset Mining (FIM) is the most important and time-consuming step of association rules mining. With the increment of data scale, many efficient single-machine algorithms of FIM, such as FP-growth and Apriori, cannot accomplish the computing tasks within reasonable time. As a result of the limitation of single-machine methods, researchers presented some distributed algorithms based on MapReduce and Spark, such as PFP and YAFIM. Nevertheless, the heavy disk I/O cost at each MapReduce operation makes PFP not efficient enough. YAFIM needs to generate candidate frequent itemsets in each iterative step. It makes YAFIM time-consuming. And if the scale of data is large enough, YAFIM algorithm will not work due to the limitation of memory since the candidate frequent itemsets need to be stored in the memory. And the size of candidate itemsets is very large especially facing the massive data. In this work, we propose a distributed FP-growth algorithm based on Spark, we call it DFPS. DFPS partitions computing tasks in such a way that each computing node builds the conditional FP-tree and adopts a pattern fragment growth method to mine the frequent itemsets independently. DFPS doesn't need to pass messages between nodes during mining frequent itemsets. Our performance study shows that DFPS algorithm is more excellent than YAFIM, especially when the length of transactions is long, the number of items is large and the data is massive. And DFPS has an excellent scalability. The experimental results show that DFPS is more than 10 times faster than YAFIM for T10I4D100K dataset and Pumsb_star dataset.

18 citations

Journal ArticleDOI
TL;DR: In this article, a simulation process for spark ignition gasoline engines is proposed, based on a zero-dimensional spark ignition stochastic reactor model and three-dimensional computational fluid dynamics of the cold in-cylinder flow.
Abstract: A simulation process for spark ignition gasoline engines is proposed. The process is based on a zero-dimensional spark ignition stochastic reactor model and three-dimensional computational fluid dynamics of the cold in-cylinder flow. The cold flow simulations are carried out to analyse changes in the turbulent kinetic energy and its dissipation. From this analysis, the volume-averaged turbulent mixing time can be estimated that is a main input parameter for the spark ignition stochastic reactor model. The spark ignition stochastic reactor model is used to simulate combustion progress and to analyse auto-ignition tendency in the end-gas zone based on the detailed reaction kinetics. The presented engineering process bridges the gap between three-dimensional and zero-dimensional models and is applicable to various engine concepts, such as, port-injected and direct injection engines, with single and multiple spark plug technology. The modelling enables predicting combustion effects and estimating the risk of ...

18 citations

Proceedings ArticleDOI
16 Apr 2018
TL;DR: This paper provides an overview of the existing works dealing with efficient query answering, in the area of RDF data, using Apache Spark, and discusses on the characteristics and the key dimension of such systems.
Abstract: The explosion of the web and the abundance of linked data demand for effective and efficient methods for storage, management and querying. More specifically, the ever-increasing size and number of RDF data collections raises the need for efficient query answering, and dictates the usage of distributed data management systems for effectively partitioning and querying them. To this direction, Apache Spark is one of the most active big-data approaches, with more and more systems adopting it, for efficient, distributed data management. The purpose of this paper is to provide an overview of the existing works dealing with efficient query answering, in the area of RDF data, using Apache Spark. We discuss on the characteristics and the key dimension of such systems, we describe novel ideas in the area, and the corresponding drawbacks, and provide directions for future work.

18 citations


Network Information
Related Topics (5)
Software
130.5K papers, 2M citations
76% related
Combustion
172.3K papers, 1.9M citations
72% related
Cluster analysis
146.5K papers, 2.9M citations
72% related
Cloud computing
156.4K papers, 1.9M citations
71% related
Hydrogen
132.2K papers, 2.5M citations
69% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202210
2021429
2020525
2019661
2018758
2017683