You are not logged in.
Openly accessible

Exploiting operating system services to efficiently checkpoint parallel applications in GENESIS

Rough, Justin and Goscinski, Andrzej 2002, Exploiting operating system services to efficiently checkpoint parallel applications in GENESIS, in 2002 5th International Conference on Algorithms and Architectures for Parallel Processing : ICA3PP/2002 : Beijing, China, October 23-25, 2002, IEEE Xplore, Piscataway, N.J., pp. 261-268.

Attached Files
Name Description MIMEType Size Downloads
goscinski-exploitingoperatingsystem-2002.pdf Published version application/pdf 460.55KB 54

Title Exploiting operating system services to efficiently checkpoint parallel applications in GENESIS
Author(s) Rough, JustinORCID iD for Rough, Justin orcid.org/0000-0003-1742-0127
Goscinski, Andrzej
Conference name International Conference on Algorithms and Architectures for Parallel Processing (5th : 2002 : Beijing, China)
Conference location Beijing, China
Conference dates 23 - 25 Oct. 2002
Title of proceedings 2002 5th International Conference on Algorithms and Architectures for Parallel Processing : ICA3PP/2002 : Beijing, China, October 23-25, 2002
Editor(s) Zhou, Wanlei
Chi, Xue-bin
Goscinski, Andrzej
Li, Guo-jie
Publication date 2002
Conference series International Conference on Algorithms and Architectures for Parallel Processing
Start page 261
End page 268
Total pages 8
Publisher IEEE Xplore
Place of publication Piscataway, N.J.
Summary Recent research efforts of parallel processing on non-dedicated clusters have focused on high execution performance, parallelism management, transparent access to resources, and making clusters easy to use. However, as a collection of independent computers used by multiple users, clusters are susceptible to failure. This paper shows the development of a coordinated checkpointing facility for the GENESIS cluster operating system. This facility was developed by exploiting existing operating system services. High performance and low overheads are achieved by allowing the processes of a parallel application to continue executing during the creation of checkpoints, while maintaining low demands on cluster resources by using coordinated checkpointing.
Notes This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
ISBN 0769515126
9780769515120
Language eng
Field of Research 080307 Operating Systems
Socio Economic Objective 970108 Expanding Knowledge in the Information and Computing Sciences
HERDC Research category E1 Full written paper - refereed
Copyright notice ©2002, IEEE
Free to Read? Yes
Persistent URL http://hdl.handle.net/10536/DRO/DU:30004682

Document type: Conference Paper
Collections: School of Information Technology
Open Access Collection
Connect to link resolver
 
Unless expressly stated otherwise, the copyright for items in DRO is owned by the author, with all rights reserved.

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.

Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 1 times in TR Web of Science
Scopus Citation Count Cited 0 times in Scopus
Google Scholar Search Google Scholar
Access Statistics: 536 Abstract Views, 54 File Downloads  -  Detailed Statistics
Created: Mon, 07 Jul 2008, 09:40:10 EST

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.