Deakin University
Browse

File(s) under permanent embargo

A comparative study at the logical level of centralised and distributed recovery in clusters

journal contribution
posted on 2005-01-01, 00:00 authored by Andrew Scott Maloney, Andrzej GoscinskiAndrzej Goscinski
Cluster systems are becoming more prevalent in today’s computer society and users are beginning to request that these systems be reliable. Currently, most clusters have been designed to provide high performance at the cost of providing little to no reliability. To combat this, this report looks at how a recovery facility, based on either a centralised or distributed approach could be implemented into a cluster that is supported by a check pointing facility. This recovery facility can then recover failed user processes by using checkpoints of the processes that have been taken during failure free execution.

History

Journal

Lecture notes in computer science

Volume

LNCS 3719

Pagination

118 - 124

Publisher

Springer, Berlin

Location

Berlin, Germany

ISSN

0302-9743

eISSN

1611-3349

Language

eng

Publication classification

C1 Refereed article in a scholarly journal

Copyright notice

2005, Springer-Verlag Berlin Heidelberg

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC