- This event has passed.
Maximizing Error Injection Realism for Chaos Engineering with System Calls
May 27, 2021 @ 16:00 - 17:00
Long Zhang (KTH) will present his work at the KTH Software Research Meetup, May 27, 16h, on zoom
Some of the perturbation models for chaos engineering are based on a random strategy such as ChaosMonkey. However, realistic perturbations could also come from errors that have naturally happened in production. I would like to share more about how to improve the realism for chaos engineering experiments.
During the talk, I would like to share our research work on chaos engineering for system call invocations. I will present a novel fault injection framework for system call invocation errors, called Phoebe. Phoebe is unique as follows: First, Phoebe enables developers to have full observability of system call invocations. Second, Phoebe generates error models that are realistic in the sense that they mimic errors that naturally happen in production. Third, Phoebe is able to automatically conduct experiments to systematically assess the reliability of applications with respect to system call invocation errors in production. We evaluate the effectiveness and runtime overhead of Phoebe on two real-world applications in a production environment. The results show that Phoebe successfully generates realistic error models and is able to detect important reliability weaknesses with respect to system call invocation errors.
The corresponding research paper has been accepted by IEEE Transactions on Dependable and Secure Computing: https://doi.org/10.1109/TDSC.2021.3069715
Long Zhang is now a Ph.D. student in computer science at KTH Royal Institute of Technology, Sweden. His research work focuses on self-healing software, chaos engineering, and antifragile systems. Long received his BE degree and ME degree in software engineering from Harbin Institute of Technology, China. Before his Ph.D. study, Long was hired by Tencent as a software developer and project manager, who was responsible for university-enterprise cooperation projects design and development.