Projects per year
Abstract
The trend for cloud computing has initiated a race towards data centres (DC) of an ever-increasing size. The largest DCs now contain many hundreds of thousands of virtual machine (VM) services. Given the finite lifespan of hardware, such large DCs are subject to frequent hardware failure events that can lead to disruption of service. To counter this, multiple redundant copies of task threads may be distributed around a DC to ensure that individual hardware failures do not cause entire jobs to fail. Here, we present results demonstrating the resilience of different job scheduling algorithms in a simulated DC with hardware failure. We use a simple model of jobs distributed across a hardware network to demonstrate the relationship between resilience and additional communication costs of different scheduling methods.
Original language | English |
---|---|
Title of host publication | 23rd European Modeling and Simulation Symposium (EMSS 2011) |
Subtitle of host publication | Proceedings of a meeting held 12-14 September 2011, Rome, Italy. Held at the International Mediterranean and Latin American Modeling Multiconference |
Editors | Agostino Bruzzone, Miquel Piera, Francesco Longo, Priscilla Elfrey, Michael Affenzeller, Osman Balci |
Publisher | University of Genoa Press |
Pages | 299-307 |
Number of pages | 9 |
ISBN (Print) | 9788890372445 |
Publication status | Published - Mar 2014 |
Event | 23rd European Modeling & Simulation Symposium (EMSS-2011) - Rome, Italy Duration: 12 Sept 2011 → 14 Sept 2011 |
Conference
Conference | 23rd European Modeling & Simulation Symposium (EMSS-2011) |
---|---|
Country/Territory | Italy |
City | Rome |
Period | 12/09/11 → 14/09/11 |
Keywords
- cloud computing
- cloud middleware
- network topology
- resilience
- simulation
Fingerprint
Dive into the research topics of 'Modelling Resilience in Cloud-Scale Data Centres'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Cloud computing for large scale complex IT systems.
Cliff, D. (Principal Investigator)
1/10/10 → 1/04/14
Project: Research
-
LSCITS-RPV2: LARGE SCALE COMPLEX IT SYSTEMS INITIATIVE
Cliff, D. (Principal Investigator)
1/07/07 → 1/07/13
Project: Research