An Empirical Study on Crash Recovery Bugs in Large-Scale Distributed Systems
In large-scale distributed systems, node crashes are inevitable, and can happen at any time. As such, distributed systems are usually designed to be resilient to these node crashes via vari-ous crash recovery mechanisms, such as write-ahead logging in HBase and hinted handoffs in Cassandra. However, faults in crash recovery mechanisms and their implementations can in-troduce intricate crash recovery bugs, and lead to severe conse-quences.
In this paper, we present CREB, the most comprehensive study on 103 Crash REcovery Bugs from four popular open-source distributed systems, including ZooKeeper, Hadoop MapReduce, Cassandra and HBase. For all the studied bugs, we analyze their root causes, triggering conditions, bug impacts and fixing. Through this study, we obtain many interesting find-ings that can open up new research directions for combating crash recovery bugs.
Thu 8 NovDisplayed time zone: Guadalajara, Mexico City, Monterrey change
13:30 - 15:00 | Software Maintenance IIResearch Papers / Journal-First at Horizons 10-11 Chair(s): Emerson Murphy-Hill North Carolina State University | ||
13:30 22mTalk | Automating Change-level Self-admitted Technical Debt Determination Journal-First Meng Yan , Xin Xia Monash University, Emad Shihab Concordia University, David Lo Singapore Management University, Jianwei Yin , Xiaohu Yang DOI | ||
13:52 22mTalk | Large-Scale Study of Substitutability in the Presence of Effects Research Papers Jackson Maddox Iowa State University, USA, Yuheng Long Iowa State University, Hridesh Rajan Iowa State University | ||
14:15 22mTalk | An Empirical Study on Crash Recovery Bugs in Large-Scale Distributed Systems Research Papers Yu Gao Institute of Software, Chinese Academy of Sciences, Wensheng Dou Institute of Software, Chinese Academy of Sciences, Feng Qin Ohio State University, USA, Chushu Gao Institute of Software, Chinese Academy of Sciences, Dong Wang Institute of Software at Chinese Academy of Sciences, China, Jun Wei State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing, Ruirui Huang Alibaba Group, China, Li Zhou Alibaba Group, China, Yongming Wu Alibaba Group, China | ||
14:37 22mTalk | Complementing Global and Local Contexts in Representing API Descriptions to Improve API Retrieval Tasks Research Papers Thanh Nguyen Iowa State University, Ngoc Tran , Hung Phan , Trong Nguyen Iowa State University, USA, Linh Truong , Trong Nguyen Iowa State University, USA, Hoan Anh Nguyen Iowa State University, USA, Tien N. Nguyen University of Texas at Dallas |