CONAN: Diagnosing Batch Failures for Cloud Systems
- Creator: Li, Liqun , Zhang, Xu , Zhang, Dongmei , He, Shilin , Kang, Yu , Zhango, Hongyu , Ma, Mingyu , Dang, Yingnong , Xu, Zhangwei , Rajmohan, Saravan , Lin, Qingwei
- Resource Type: conference paper
- Date: 2023
Fast Outage Analysis of Large-Scale Production Clouds with Service Correlation Mining
- Creator: Wang, Yaohui , Li, Guozheng , Xu, Zhangwei , Zhao, Pu , Qiao, Bo , Li, Liqun , Zhang, Xu , Lin, Qingwei , Wang, Zijian , Kang, Yu , Zhou, Yangfan , Zhang, Hongyu , Gao, Feng , Sun, Jeffrey , Yang, Li , Lee, Pochian
- Resource Type: conference paper
- Date: 2021
Fighting the Fog of War: Automated Incident Detection for Cloud Systems
- Creator: Li, Liqun , Zhang, Xu , Gao, Feng , Yang, Li , Lin, Qingwei , Rajmohan, Saravanakumar , Xu, Zhangwei , Zhang, Dongmei , Zhao, Xin , Zhang, Hongyu , Kang, Yu , Zhao, Pu , Qiao, Bo , He, Shilin , Lee, Pochian , Sun, Jeffrey
- Resource Type: conference paper
- Date: 2021
How long will it take to mitigate this incident for online service systems?
- Creator: Wang, Weijing , Chen, Junjie , Xu, Zhangwei , Dang, Yingnong , Zhang, Dongmei , Yang, Lin , Zhang, Hongyu , Zhao, Pu , Qiao, Bo , Kang, Yu , Lin, Qingwei , Rajmohan, Saravanakumar , Gao, Feng
- Resource Type: conference paper
- Date: 2021
How incidental are the incidents? Characterizing and prioritizing incidents for large-scale online service systems
- Creator: Chen, Junjie , Zhang, Shu , He, Xiaoting , Lin, Qingwei , Zhang, Hongyu , Hao, Dan , Kang, Yu , Gao, Feng , Xu, Zhangwei , Dang, Yingnong , Zhang, Dongmei
- Resource Type: conference paper
- Date: 2020
How to mitigate the incident? An effective troubleshooting guide recommendation technique for online service systems
- Creator: Jiang, Jiajun , Lu, Weihai , Chen, Junjie , Lin, Qingwei , Zhao, Pu , Kang, Yu , Zhang, Hongyu , Xiong, Yingfei , Gao, Feng , Xu, Zhangwei , Dang, Yingnong , Zhang, Dongmei
- Resource Type: conference paper
- Date: 2020
Identifying linked incidents in large-scale online service systems
- Creator: Chen, Yujun , Yang, Xian , Dong, Hang , He, Xiaoting , Zhang, Hongyu , Lin, Qingwei , Chen, Junjie , Zhao, Pu , Kang, Yu , Gao, Feng , Xu, Zhangwei , Zhang, Dongmei
- Resource Type: conference paper
- Date: 2020
Towards Intelligent Incident Management: Why We Need It and How We Make It
- Creator: Chen, Zhuangbin , Kang, Yu , Li, Liqun , Zhang, Xu , Zhang, Hongyu , Xu, Hui , Zhou, Yangfan , Yang, Li , Sun, Jeffrey , Xu, Zhangwei , Dang, Yingnong , Gao, Feng , Zhao, Pu , Qiao, Bo , Lin, Qingwei , Zhang, Dongmei , Lyu, Michael R.
- Resource Type: conference paper
- Date: 2020
An Empirical Investigation of Incident Triage for Online Service Systems
- Creator: Chen, Junjie , He, Xiaoting , Lin, Qingwei , Xu, Yong , Zhang, Hongyu , Hao, Dan , Gao, Feng , Xu, Zhangwei , Dang, Yingnong , Zhang, Dongmei
- Resource Type: conference paper
- Date: 2019
Continuous incident triage for large-scale online service systems
- Creator: Chen, Junjie , He, Xiaoting , Lin, Qingwei , Zhang, Hongyu , Hao, Dan , Gao, Feng , Xu, Zhangwei , Dang, Yingnong , Zhang, Dongnei
- Resource Type: conference paper
- Date: 2019
Outage Prediction and Diagnosis for Cloud Service Systems
- Creator: Chen, Yujun , Zhang, Hongyu , Xu, Zhangwei , Dang, Yingnong , Yang, Xian , Lin, Qingwei , Zhang, Dongmei , Dong, Hang , Xu, Yong , Li, Hao , Kang, Yu , Gao, Feng
- Resource Type: conference paper
- Date: 2019