Authors :
Muvva Venkatesh; Mohammed Taha; Dr. K. Rajitha; Dr. K. Sreekala
Volume/Issue :
Volume 11 - 2026, Issue 5 - May
Google Scholar :
https://tinyurl.com/3n8tdj4y
Scribd :
https://tinyurl.com/3mxuxczu
DOI :
https://doi.org/10.38124/ijisrt/26May1147
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
Cloud computing platforms such as AWS, Microsoft Azure, Google Cloud Platform (GCP), and Cloudflare provide
critical infrastructure for modern applications. Despite high availability guarantees, service outages continue to occur due
to various factors including infrastructure failures, network disruptions, and configuration errors. These incidents
significantly impact service reliability and user experience. This paper proposes a multi-cloud incident analysis framework
based on a normalized dataset constructed using a standardized 14-column reliability schema. The framework introduces
a COREM (Cloud Outage Risk Evaluation Model) weighted scoring algorithm to quantify and compare the risk associated
with incidents across different cloud providers. The model evaluates incidents based on severity, duration, impact, and
frequency. The system enables cross-provider comparative analysis and visualization of outage trends, helping to identify
reliability patterns and high-risk services. The proposed approach improves transparency in cloud reliability assessment
and supports better decision-making for multi-cloud deployment strategies.
Keywords :
Cloud Computing, Incident Analysis, Reliability Assessment, Multi-Cloud, COREM Model, Risk Scoring, Outage Analysis Calibration Explainable Artificial Intelligence.
References :
- Unyi, A., et al. “Explainable Graph Neural Network-Based Fault Forecasting for Cloud Service Debugging.” 2025 International Conference on Cloud Computing and Artificial Intelligence, 2025.
- Cinque, M., et al. “Resilience Evaluation Metrics for Cloud Services.” Future Generation Computer Systems, vol. 95, 2019, pp. 964–977.
- Zhang, Y., et al. “Root Cause Analysis of Failures in Large-Scale Cloud Systems.” Proceedings of the USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2018.
- Gupta, A., et al. “An Empirical Study of Cloud Outages and Their Causes.” IEEE International Conference on Cloud Engineering (IC2E), 2017.
- Baset, S. A., et al. “Towards Understanding Cloud Service Level Agreements and Reliability.” IEEE International Conference on Cloud Computing, 2016.
- Dean, J. “Designs, Lessons and Advice from Building Large Distributed Systems.” Keynote Presentation, ACM Symposium on Operating Systems Principles (SOSP), 2014.
- Bailis, P., et al. “Coordination Avoidance in Database Systems.” Proceedings of the VLDB Endowment, vol. 8, no. 3, 2014, pp. 185–196.
- Birke, R., et al. “Predictive Modeling for Failure Prediction in Cloud Systems.” IEEE Transactions on Cloud Computing, vol. 2, no. 3, 2014, pp. 290–303.
- Kandula, S., et al. “Detailed Analysis of Data Center Failures in Cloud Computing Environments.” Proceedings of the ACM SIGCOMM Conference, 2009.
- Fox, A., et al. “Above the Clouds: A Berkeley View of Cloud Computing.” University of California, Berkeley Technical Report, 2009.
Cloud computing platforms such as AWS, Microsoft Azure, Google Cloud Platform (GCP), and Cloudflare provide
critical infrastructure for modern applications. Despite high availability guarantees, service outages continue to occur due
to various factors including infrastructure failures, network disruptions, and configuration errors. These incidents
significantly impact service reliability and user experience. This paper proposes a multi-cloud incident analysis framework
based on a normalized dataset constructed using a standardized 14-column reliability schema. The framework introduces
a COREM (Cloud Outage Risk Evaluation Model) weighted scoring algorithm to quantify and compare the risk associated
with incidents across different cloud providers. The model evaluates incidents based on severity, duration, impact, and
frequency. The system enables cross-provider comparative analysis and visualization of outage trends, helping to identify
reliability patterns and high-risk services. The proposed approach improves transparency in cloud reliability assessment
and supports better decision-making for multi-cloud deployment strategies.
Keywords :
Cloud Computing, Incident Analysis, Reliability Assessment, Multi-Cloud, COREM Model, Risk Scoring, Outage Analysis Calibration Explainable Artificial Intelligence.