Home
/
Authors
/
Valentin Kravtsov

Author

Valentin Kravtsov

Technion – Israel Institute of Technology

Bio: Valentin Kravtsov is an academic researcher from Technion – Israel Institute of Technology. The author has contributed to research in topics: Grid & Grid computing. The author has an hindex of 7, co-authored 10 publications receiving 207 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Grid-enabling data mining applications with DataMiningGrid: An architectural perspective

[...]

Vlado Stankovski¹, Martin T. Swain², Valentin Kravtsov³, Thomas Niessen⁴, Dennis Wegener⁴, Jörg Kindermann⁴, Werner Dubitzky² - Show less +3 more•Institutions (4)

University of Ljubljana¹, Ulster University², Technion – Israel Institute of Technology³, Fraunhofer Society⁴

01 Apr 2008

TL;DR: The DataMiningGrid system provides tools and services facilitating the grid-enabling of data mining applications without any intervention on the application side and critical features of the system include flexibility, extensibility, scalability, efficiency, conceptual simplicity and ease of use.

...read moreread less

Abstract: The DataMiningGrid system has been designed to meet the requirements of modern and distributed data mining scenarios. Based on the Globus Toolkit and other open technology and standards, the DataMiningGrid system provides tools and services facilitating the grid-enabling of data mining applications without any intervention on the application side. Critical features of the system include flexibility, extensibility, scalability, efficiency, conceptual simplicity and ease of use. The system has been developed and evaluated on the basis of a diverse set of use cases from different sectors in science and technology. The DataMiningGrid software is freely available under Apache License 2.0.

...read moreread less

88 citations

Journal Article•DOI•

Digging Deep into the Data Mine with DataMiningGrid

[...]

Vlado Stankovski¹, Martin T. Swain², Valentin Kravtsov³, Thomas Niessen, Dennis Wegener⁴, M. Rohm⁵, Jernej Trnkoczy¹, M. May⁴, J. Franke⁵, Assaf Schuster³, Werner Dubitzky² - Show less +7 more•Institutions (5)

University of Ljubljana¹, Ulster University², Technion – Israel Institute of Technology³, Fraunhofer Society⁴, Daimler AG⁵

01 Nov 2008-IEEE Internet Computing

TL;DR: The authors developed the DataMiningGrid system, which integrates a diverse set of programs and application scenarios within a single framework, and features scalability, flexible extensibility, sophisticated support for relevant standards and different users.

...read moreread less

Abstract: As modern data mining applications increase in complexity, so too do their demands for resources. Grid computing is one of several emerging networked computing paradigms promising to meet the requirements of heterogeneous, large-scale, and distributed data mining applications. Despite this promise, there are still too many issues to be resolved before grid technology is commonly applied to large-scale data mining tasks. To address some of these issues, the authors developed the DataMiningGrid system. It integrates a diverse set of programs and application scenarios within a single framework, and features scalability, flexible extensibility, sophisticated support for relevant standards and different users.

...read moreread less

30 citations

Proceedings Article•DOI•

NAP: a building block for remediating performance bottlenecks via black box network analysis

[...]

Muli Ben-Yehuda¹, David Breitgand¹, Michael Factor¹, Hillel Kolodner¹, Valentin Kravtsov², Dan Pelleg - Show less +2 more•Institutions (2)

IBM¹, Technion – Israel Institute of Technology²

15 Jun 2009

TL;DR: A simple, yet powerful, methodology for application-agnostic diagnostic and remediation of performance hot spots in elastic multi-tiered client/server applications, deployed as collections of black box Virtual Machines (VM).

...read moreread less

Abstract: In this work we present a simple, yet powerful, methodology for application-agnostic diagnostic and remediation of performance hot spots in elastic multi-tiered client/server applications, deployed as collections of black box Virtual Machines (VM). Our novel out-of-band black-box performance management system, Network Analysis for Remediating Performance Bottlenecks (NAP), listens to the TCP/IP traffic on the virtual network interfaces of the VMs comprising an application and analyzes statistical properties of this traffic. From this analysis, which is application independent and transparent to the VMs, NAP identifies performance bottlenecks that might effect application performance and derives remediation decisions that are most likely to alleviate the application performance degradation. We prototyped our solution for the Xen hypervisor and evaluated it using the popular Trade6 benchmark that simulates a typical e-commerce application. Our results show that NAP successfully identifies performance bottlenecks in a complex multi-tier application setting, while incurring negligible performance overhead.

...read moreread less

28 citations

Proceedings Article•DOI•

Running Parallel Applications with Topology-Aware Grid Middleware

[...]

Pavel Bar¹, Camille Coti², Derek Groen³, Thomas Herault², Valentin Kravtsov¹, Assaf Schuster¹, Martin T. Swain⁴ - Show less +3 more•Institutions (4)

Technion – Israel Institute of Technology¹, French Institute for Research in Computer Science and Automation², University of Amsterdam³, Ulster University⁴

09 Dec 2009

TL;DR: The concept of topology-aware grid applications is derived from parallelized computational models of complex systems that are executed on heterogeneous resources, either because they require specialized hardware for certain calculations, or because their parallelization is flexible enough to exploit such resources.

...read moreread less

Abstract: The concept of topology-aware grid applications is derived from parallelized computational models of complex systems that are executed on heterogeneous resources, either because they require specialized hardware for certain calculations, or because their parallelization is flexible enough to exploit such resources. Here we describe two such applications, a multi-body simulation of stellar evolution, and an evolutionary algorithm that is used for reverse-engineering gene regulatory networks. We then describe the topology-aware middleware we have developed to facilitate the ``modeling-implementing-executing'' cycle of complex systems applications. The developed middleware allows topology-aware simulations to run on geographically distributed clusters with or without firewalls between them. Additionally, we describe advanced coallocation and scheduling techniques that take into account the applications topologies. Results are given based on running the topology-aware applications on the Grid'5000 infrastructure.

...read moreread less

17 citations

Proceedings Article•

Grid-enabling complex system applications with QosCosGrid: an architectural perspective

[...]

Valentin Kravtsov, David Carmeli¹, Werner Dubitzky, Krzysztof Kurowski², Assaf Schuster¹ - Show less +1 more•Institutions (2)

Technion – Israel Institute of Technology¹, Polish Academy of Sciences²

01 Jan 2008

TL;DR: This work designs architecture for a quasi-opportunistic supercomputer within the EU-supported project QosCosGrid, and presents the results obtained from studying and identifying the requirements a grid needs to meet in order to facilitate quasi- opportunistic supercomputing.

...read moreread less

Abstract: Grids are becoming mission-critical components in research and industry, offering sophisticated solutions in leveraging large-scale computing and storage resources. Grid resources are usually shared among multiple organizations in an opportunistic manner. However, an opportunistic or "best effort" quality-of-service scheme may be inadequate in situations where a large number of resources need to be allocated and applications which rely on static, stable execution environments. The goal of this work is to implement what we refer to as quasi-opportunistic supercomputing. A quasi-opportunistic supercomputer facilitates demanding parallel computing applications on the basis of massive, non-dedicated resources in grid computing environments. Within the EU-supported project QosCosGrid we are developing a quasi-opportunistic supercomputer. In this work we present the results obtained from studying and identifying the requirements a grid needs to meet in order to facilitate quasi-opportunistic supercomputing. Based on these requirements we have designed architecture for a quasi-opportunistic supercomputer. The paper presents and discusses this architecture.

...read moreread less

14 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•

The Grid 2: Blueprint for a New Computing Infrastructure

[...]

R.V. van Nieuwpoort

01 Jan 2003

1,212 citations

Proceedings Article•DOI•

CloudScale: elastic resource scaling for multi-tenant cloud systems

[...]

Zhiming Shen¹, Sethuraman Subbiah¹, Xiaohui Gu¹, John Wilkes²•Institutions (2)

North Carolina State University¹, Google²

26 Oct 2011

TL;DR: CloudScale is a system that automates fine-grained elastic resource scaling for multi-tenant cloud computing infrastructures that can achieve significantly higher SLO conformance than other alternatives with low resource and energy cost.

...read moreread less

Abstract: Elastic resource scaling lets cloud systems meet application service level objectives (SLOs) with minimum resource provisioning costs. In this paper, we present CloudScale, a system that automates fine-grained elastic resource scaling for multi-tenant cloud computing infrastructures. CloudScale employs online resource demand prediction and prediction error handling to achieve adaptive resource allocation without assuming any prior knowledge about the applications running inside the cloud. CloudScale can resolve scaling conflicts between applications using migration, and integrates dynamic CPU voltage/frequency scaling to achieve energy savings with minimal effect on application SLOs. We have implemented CloudScale on top of Xen and conducted extensive experiments using a set of CPU and memory intensive applications (RUBiS, Hadoop, IBM System S). The results show that CloudScale can achieve significantly higher SLO conformance than other alternatives with low resource and energy cost. CloudScale is non-intrusive and light-weight, and imposes negligible overhead (

...read moreread less

662 citations

Proceedings Article•

Complex systems

[...]

B. Keepence¹, Mike Mannion¹•Institutions (1)

Edinburgh Napier University¹

24 Mar 1997

TL;DR: A fresh look is presented at the nature of complexity in the building of computer based systems with a wide range of reasons all the way from hardware failures through software errors right to major system level mistakes.

...read moreread less

Abstract: Every organisation from the scale of whole countries down to small companies has a list of system developments which have ended in various forms of disaster. The nature of the failures varies but typical examples are: cost overruns; timescale overruns and sometimes, loss of life. The post-mortems to these systems reveal a wide range of reasons all the way from hardware failures, through software errors right to major system level mistakes. More importantly a large number of these systems share one attribute: complexity. This paper presents a fresh look at the nature of complexity in the building of computer based systems.

...read moreread less

620 citations

Proceedings Article•DOI•

PRESS: PRedictive Elastic ReSource Scaling for cloud systems

[...]

Zhenhuan Gong¹, Xiaohui Gu¹, John Wilkes²•Institutions (2)

North Carolina State University¹, Google²

01 Oct 2010

TL;DR: This paper presents a novel PRedictive Elastic reSource Scaling (PRESS) scheme for cloud systems that unobtrusively extracts fine-grained dynamic patterns in application resource demands and adjust their resource allocations automatically.

...read moreread less

Abstract: Cloud systems require elastic resource allocation to minimize resource provisioning costs while meeting service level objectives (SLOs). In this paper, we present a novel PRedictive Elastic reSource Scaling (PRESS) scheme for cloud systems. PRESS unobtrusively extracts fine-grained dynamic patterns in application resource demands and adjust their resource allocations automatically. Our approach leverages light-weight signal processing and statistical learning algorithms to achieve online predictions of dynamic application resource requirements. We have implemented the PRESS system on Xen and tested it using RUBiS and an application load trace from Google. Our experiments show that we can achieve good resource prediction accuracy with less than 5% over-estimation error and near zero under-estimation error, and elastic resource scaling can both significantly reduce resource waste and SLO violations.

...read moreread less

591 citations

Proceedings Article•

AGILE: elastic distributed resource scaling for Infrastructure-as-a-Service

[...]

Hiep Nguyen¹, Zhiming Shen¹, Xiaohui Gu¹, Sethuraman Subbiah², John Wilkes³ - Show less +1 more•Institutions (3)

North Carolina State University¹, NetApp², Google³

01 Jan 2013

TL;DR: AGILE uses wavelets to provide a medium-term resource demand prediction with enough lead time to start up new application server instances before performance falls short, and it uses dynamic VM cloning to reduce application startup times.

...read moreread less

Abstract: Dynamically adjusting the number of virtual machines (VMs) assigned to a cloud application to keep up with load changes and interference from other uses typically requires detailed application knowledge and an ability to know the future, neither of which are readily available to infrastructure service providers or application owners. The result is that systems need to be over-provisioned (costly), or risk missing their performance Service Level Objectives (SLOs) and have to pay penalties (also costly). AGILE deals with both issues: it uses wavelets to provide a medium-term resource demand prediction with enough lead time to start up new application server instances before performance falls short, and it uses dynamic VM cloning to reduce application startup times. Tests using RUBiS and Google cluster traces show that AGILE can predict varying resource demands over the medium-term with up to 3.42× better true positive rate and 0.34× the false positive rate than existing schemes. Given a target SLO violation rate, AGILE can efficiently handle dynamic application workloads, reducing both penalties and user dissatisfaction.

...read moreread less

267 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37

Collapse