Deakin University
Browse

File(s) not publicly available

Reliability-aware distributed computing scheduling policy

chapter
posted on 2015-01-01, 00:00 authored by Jemal AbawajyJemal Abawajy, M M Hassan
One of the primary issues associated with the efficient and effective utilization of distributed computing is resource management and scheduling. As distributed computing resource failure is a common occurrence, the issue of deploying support for integrated scheduling and fault-tolerant approaches becomes paramount importance. To this end, we propose a fault-tolerant dynamic scheduling policy that loosely couples dynamic job scheduling with job replication scheme such that jobs are efficiently and reliably executed. The novelty of the proposed algorithm is that it uses passive replication approach under high system load and active replication approach under low system loads. The switch between these two replication methods is also done dynamically and transparently. Performance evaluation of the proposed fault-tolerant scheduler and a comparison with similar fault-tolerant scheduling policy is presented and shown that the proposed policy performs better than the existing approach.

History

Title of book

Algorithms and architectures for parallel processing : ICA3PP international workshops and symposiums, Zhangjiajie, China, November 18-20, 2015, proceedings

Volume

9532

Series

Lecture notes in computer science

Chapter number

57

Pagination

627 - 632

Publisher

Springer International Publishing

Place of publication

Cham, Switzerland

ISSN

0302-9743

ISBN-13

978-3-319-27160-6

Language

eng

Notes

Presented at ICA3PP international workshops and symposiums. Zhangjiajie, China, November 18-20, 2015

Publication classification

B Book chapter; B1 Book chapter

Copyright notice

2015, Springer

Extent

77

Editor/Contributor(s)

W Guojun, A Zomaya, G Perez, K Li