Search or ask a question

Showing papers by "Roger Impey published in 2011"

PDF

Open Access

Journal Article•DOI•

A batch system for HEP applications on a distributed IaaS cloud

[...]

Ian Gable¹, A Agarwal¹, M Anderson¹, P Armstrong¹, K Fransham¹, D Harris C Leavett-Brown¹, Michael Paterson¹, Duncan Penfold-Brown¹, Randall Sobie¹, M Vliet¹, Andre Charbonneau², Roger Impey², W Podaima² - Show less +9 more•Institutions (2)

University of Victoria¹, National Research Council²

23 Dec 2011

TL;DR: A system for creating a single dynamic batch environment spanning multiple IaaS clouds of different types and it is shown that the system provides the ability to exploit academic and commercial cloud sites in a transparent fashion.

...read moreread less

Abstract: The emergence of academic and commercial Infrastructure-as-a-Service (IaaS) clouds is opening access to new resources for the HEP community. In this paper we will describe a system we have developed for creating a single dynamic batch environment spanning multiple IaaS clouds of different types (e.g. Nimbus, OpenNebula, Amazon EC2). A HEP user interacting with the system submits a job description file with a pointer to their VM image. VM images can either be created by users directly or provided to the users. We have created a new software component called Cloud Scheduler that detects waiting jobs and boots the user VM required on any one of the available cloud resources. As the user VMs appear, they are attached to the job queues of a central Condor job scheduler, the job scheduler then submits the jobs to the VMs. The number of VMs available to the user is expanded and contracted dynamically depending on the number of user jobs. We present the motivation and design of the system with particular emphasis on Cloud Scheduler. We show that the system provides the ability to exploit academic and commercial cloud sites in a transparent fashion.

...read moreread less

11 citations

Posted Content•

Data Intensive High Energy Physics Analysis in a Distributed Cloud

[...]

Randall Sobie, A Agarwal, M Anderson, P Armstrong, K Fransham, Ian Gable, D Harris, Colin Leavett-Brown, Michael Paterson, Duncan Penfold-Brown, M Vliet, Andre Charbonneau, Roger Impey, W Podaima - Show less +10 more

01 Jan 2011-arXiv: Distributed, Parallel, and Cluster Computing

TL;DR: In this article, a distributed Infrastructure-as-a-Service (IaaS) compute clouds can be effectively used for the analysis of high energy physics data, where a user prepares an analysis virtual machine (VM) and submits batch jobs to a central scheduler.

...read moreread less

Abstract: We show that distributed Infrastructure-as-a-Service (IaaS) compute clouds can be effectively used for the analysis of high energy physics data. We have designed a distributed cloud system that works with any application using large input data sets requiring a high throughput computing environment. The system uses IaaS-enabled science and commercial clusters in Canada and the United States. We describe the process in which a user prepares an analysis virtual machine (VM) and submits batch jobs to a central scheduler. The system boots the user-specific VM on one of the IaaS clouds, runs the jobs and returns the output to the user. The user application accesses a central database for calibration data during the execution of the application. Similarly, the data is located in a central location and streamed by the running application. The system can easily run one hundred simultaneous jobs in an efficient manner and should scale to many hundreds and possibly thousands of user jobs.

...read moreread less

6 citations

Proceedings Article•DOI•

Repoman: A Simple RESTful X.509 Virtual Machine Image Repository

[...]

Ian Gable, Roger Impey, Randall Sobie

30 Sep 2011

TL;DR: It is shown that Repoman removes the burden of image management from users while simplifying the deployment of user specific virtual machines.

...read moreread less

Abstract: With broader use of IaaS science clouds the management of multiple Virtual Machine (VM) images is becoming increasingly daunting for the user. In a typical workflow, users work on a prototype VM, clone it and upload it in preparation for building a virtual cluster of identical instances. We describe and benchmark a novel VM image repository (Repoman), which can be used to clone, update, manage, store and distribute VM images to multiple clouds. Users authenticate using X.509 grid proxy certificates to authenticate against Repoman’s simple REST API. The lightweight Repoman CLI client tool has minimal python dependencies and can be installed in seconds using standard Python tools. We show that Repoman removes the burden of image management from users while simplifying the deployment of user specific virtual machines.

...read moreread less

6 citations