Experience report: Anomaly detection of cloud application operations using log and cloud metric correlation analysis

Farshchi, M; Schneider, Jean-Guy; Weber, I; Grundy, John

File(s) under permanent embargo

Experience report: Anomaly detection of cloud application operations using log and cloud metric correlation analysis

conference contribution

posted on 2016-01-13, 00:00 authored by M Farshchi, Jean-Guy SchneiderJean-Guy Schneider, I Weber, John Grundy

Failure of application operations is one of the main causes of system-wide outages in cloud environments. This particularly applies to DevOps operations, such as backup, redeployment, upgrade, customized scaling, and migration that are exposed to frequent interference from other concurrent operations, configuration changes, and resources failure. However, current practices fail to provide a reliable assurance of correct execution of these kinds of operations. In this paper, we present an approach to address this problem that adopts a regression-based analysis technique to find the correlation between an operation's activity logs and the operation activity's effect on cloud resources. The correlation model is then used to derive assertion specifications, which can be used for runtime verification of running operations and their impact on resources. We evaluated our proposed approach on Amazon EC2 with 22 rounds of rolling upgrade operations while other types of operations were running and random faults were injected. Our experiment shows that our approach successfully managed to raise alarms for 115 random injected faults, with a precision of 92.3%.

History

Pagination

24 - 34

Publisher DOI

https://doi.org/10.1109/ISSRE.2015.7381796

ISBN-13

9781509004065

Publication classification

E Conference publication; E1.1 Full written paper - refereed

Copyright notice

2015, IEEE

Title of proceedings

2015 IEEE 26th International Symposium on Software Reliability Engineering, ISSRE 2015

Usage metrics

Keywords

Science & Technology Technology Computer Science, Software Engineering Engineering, Electrical & Electronic Computer Science Engineering Cloud application operations DevOps Cloud monitoring anomaly detection error detection log analysis

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) under permanent embargo

Experience report: Anomaly detection of cloud application operations using log and cloud metric correlation analysis

History

Pagination

Publisher DOI

ISBN-13

Publication classification

Copyright notice

Title of proceedings

Usage metrics

Categories

Keywords

Licence

Exports