Towards Proactive Fault Management of Enterprise Systems
Author | Jia, Rui |
Author | Abdelwahed, Sherif |
Author | Erradi, Abdelkarim |
Available date | 2023-04-10T09:10:05Z |
Publication Date | 2015 |
Publication Name | Proceedings - 2015 International Conference on Cloud and Autonomic Computing, ICCAC 2015 |
Resource | Scopus |
Abstract | This paper introduces a model-based approach for autonomic fault management of computing systems. The proposed approach can recover a system from common faults while minimizing the impact on the system's quality of service and reducing potential revenue loss. When faults occur, the approach identifies fault types and accordingly compute the optimal recovery action with minimum impact on performance and operating cost using a predictive control algorithm. The paper introduces the formal settings of the model-based fault management approach and the underlying predictive control algorithm. The fault management approach has been verified on a testbed with respect to simulated faults including memory leak and network congestion. Simulation results show that our approach enabled effective automatic recovery from these faults with minimum impacts of system performance. 2015 IEEE. |
Language | en |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Subject | Autonomic Computing Fault Tolerance Model-based Control Self-healing |
Type | Conference Paper |
Pagination | 21-32 |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |
This item appears in the following Collection(s)
-
Computer Science & Engineering [2402 items ]