Abawajy, Jemal and Hassan, Mohammad Mehedi 2015, Reliability-aware distributed computing scheduling policy. In Wang, Guojun, Zomaya, Albert, Perez, Gregorio Martinez and Li, Kenli (ed), Algorithms and architectures for parallel processing : ICA3PP international workshops and symposiums, Zhangjiajie, China, November 18-20, 2015, proceedings, Springer International Publishing, Cham, Switzerland, pp.627-632, doi: 10.1007/978-3-319-27161-3_57.
One of the primary issues associated with the efficient and effective utilization of distributed computing is resource management and scheduling. As distributed computing resource failure is a common occurrence, the issue of deploying support for integrated scheduling and fault-tolerant approaches becomes paramount importance. To this end, we propose a fault-tolerant dynamic scheduling policy that loosely couples dynamic job scheduling with job replication scheme such that jobs are efficiently and reliably executed. The novelty of the proposed algorithm is that it uses passive replication approach under high system load and active replication approach under low system loads. The switch between these two replication methods is also done dynamically and transparently. Performance evaluation of the proposed fault-tolerant scheduler and a comparison with similar fault-tolerant scheduling policy is presented and shown that the proposed policy performs better than the existing approach.
Presented at ICA3PP international workshops and symposiums. Zhangjiajie, China, November 18-20, 2015
Field of Research
080501 Distributed and Grid Systems
Socio Economic Objective
970108 Expanding Knowledge in the Information and Computing Sciences
Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact email@example.com.