File(s) not publicly available
Reliability-aware distributed computing scheduling policy
One of the primary issues associated with the efficient and effective utilization of distributed computing is resource management and scheduling. As distributed computing resource failure is a common occurrence, the issue of deploying support for integrated scheduling and fault-tolerant approaches becomes paramount importance. To this end, we propose a fault-tolerant dynamic scheduling policy that loosely couples dynamic job scheduling with job replication scheme such that jobs are efficiently and reliably executed. The novelty of the proposed algorithm is that it uses passive replication approach under high system load and active replication approach under low system loads. The switch between these two replication methods is also done dynamically and transparently. Performance evaluation of the proposed fault-tolerant scheduler and a comparison with similar fault-tolerant scheduling policy is presented and shown that the proposed policy performs better than the existing approach.
History
Title of book
Algorithms and architectures for parallel processing : ICA3PP international workshops and symposiums, Zhangjiajie, China, November 18-20, 2015, proceedingsVolume
9532Series
Lecture notes in computer scienceChapter number
57Pagination
627 - 632Publisher
Springer International PublishingPlace of publication
Cham, SwitzerlandPublisher DOI
ISSN
0302-9743ISBN-13
978-3-319-27160-6Language
engNotes
Presented at ICA3PP international workshops and symposiums. Zhangjiajie, China, November 18-20, 2015Publication classification
B Book chapter; B1 Book chapterCopyright notice
2015, SpringerExtent
77Editor/Contributor(s)
W Guojun, A Zomaya, G Perez, K LiUsage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC