Show simple item record

AuthorJia, Rui
AuthorAbdelwahed, Sherif
AuthorErradi, Abdelkarim
Available date2023-04-10T09:10:05Z
Publication Date2015
Publication NameProceedings - 2015 International Conference on Cloud and Autonomic Computing, ICCAC 2015
ResourceScopus
URIhttp://dx.doi.org/10.1109/ICCAC.2015.18
URIhttp://hdl.handle.net/10576/41817
AbstractThis paper introduces a model-based approach for autonomic fault management of computing systems. The proposed approach can recover a system from common faults while minimizing the impact on the system's quality of service and reducing potential revenue loss. When faults occur, the approach identifies fault types and accordingly compute the optimal recovery action with minimum impact on performance and operating cost using a predictive control algorithm. The paper introduces the formal settings of the model-based fault management approach and the underlying predictive control algorithm. The fault management approach has been verified on a testbed with respect to simulated faults including memory leak and network congestion. Simulation results show that our approach enabled effective automatic recovery from these faults with minimum impacts of system performance. 2015 IEEE.
Languageen
PublisherInstitute of Electrical and Electronics Engineers Inc.
SubjectAutonomic Computing
Fault Tolerance
Model-based Control
Self-healing
TitleTowards Proactive Fault Management of Enterprise Systems
TypeConference Paper
Pagination21-32


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record