Deakin University
Browse

A practical cross-datacenter fault-tolerance algorithm in the cloud storage system

Version 2 2024-06-03, 11:49
Version 1 2017-07-26, 15:35
journal contribution
posted on 2017-06-01, 00:00 authored by Yuxia Cheng, X Yu, W Chen, R Chang, Yang Xiang
The fault-tolerance property in most cloud storage systems are designed within the scale of a single datacenter. The single datacenter as a whole may be unreachable or crashed due to severe problems, such as broken network links, power supply interruptions, and natural disasters, etc. Therefore, the design of an effective cross-datacenter fault-tolerant storage system is important to protect data security in the cloud. However, building a cross-datacenter fault-tolerant system faces great challenges, such as high latency, low throughput, high costs of bandwidth resources between datacenters. In this paper, we propose a practical cross-datacenter fault-tolerant (CDFT) algorithm in the cloud storage system. Our fault-tolerant algorithm design considers the difficult tradeoffs among fault tolerance, latency, throughput, network and storage costs. We propose the Domain Fault Codes (DFC) and the topology-aware scheduling techniques, which can tolerate the whole datacenter breakdown. We implemented the DFC-CDFT algorithm in a prototype cloud storage system. The experimental results showed that the proposed DFC-CDFT algorithm can effectively recover data blocks from the single datacenter failure while achieves low storage and bandwidth costs.

History

Journal

Cluster Computing

Volume

20

Issue

2

Pagination

1801 - 1813

Publisher

Springer New York LLC

Location

New York, N.Y.

ISSN

1386-7857

eISSN

1573-7543

Language

eng

Publication classification

C Journal article; C1 Refereed article in a scholarly journal

Copyright notice

2017, Springer Science+Business Media New York