File(s) under permanent embargo
Fault detection service architecture for grid computing systems
The ability to tolerate failures while effectively exploiting the grid computing resources in an scalable and transparent manner must be an integral part of grid computing infrastructure. Hence, fault-detection service is a necessary prerequisite to fault tolerance and fault recovery in grid computing. To this end, we present an scalable fault detection service architecture. The proposed fault-detection system provides services that monitors user applications, grid middlewares and the dynamically changing state of a collection of distributed resources. It reports summaries of this information to the appropriate agents on demand or instantaneously in the event of failures.
History
Journal
Lecture notes in computer scienceVolume
3044Pagination
107 - 115Publisher
Springer-VerlagLocation
Berlin, GermanyISSN
0302-9743eISSN
1611-3349Language
engPublication classification
C1 Refereed article in a scholarly journalCopyright notice
2004, Springer-VerlagUsage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC