Systems | Development | Analytics | API | Testing

Chaos Engineering

7 Steps to Execute Chaos Engineering

We’ve all heard about the significant WhatsApp breakdowns that have happened in the recent past, during which the app was unavailable for the public for an hour. However, from a technical standpoint, WhatsApp returned in less than an hour. What would have enabled the engineers at WhatsApp to quickly restore the services? Technically speaking, the team experienced an extremely stressful production failure because of this.

Resilience and chaos testing with SteadyBit and k6, with Benjamin Wilms (k6 Office Hours #61)

What is the difference between resilience testing and chaos testing? SteadyBit CEO and co-founder Benjamin Wilms talks about how to integrate SteadyBit with k6 for both resilience and chaos, and why he prefers "chaos testing" to "chaos engineering".

Optimize Your API Gateway with Chaos Engineering

As engineers and architects, we automatically build resilience into platforms as far as possible. But what about the unknown failures? What about the unknown behavior of your platform? The philosopher, Socrates, once said “You don’t know what you don’t know”. What if I could tell you there is a way to turn these unknowns into knowns – a way to understand how your platform will behave to specific failure events…

Why is everyone talking about chaos engineering? with Vince Huang (k6 Office Hours #56)

Why is everyone talking about chaos engineering anyway? How different is it from testing? What does it have to do with performance? In this k6 Office Hours, k6 Technical Program Manager Vince Huang joins Developer Advocates Nicole van der Hoeven and Paul Balogh to talk about these topics and more.

There's more than Performance Testing - Chaos Engineering with k6 and Steadybit

Software development is entirely different today than it was a few years ago. Back then, we usually had a big monolith running on our own hardware. We mainly did performance tests to see if the hardware resources were sufficient to handle the load. Today, we develop software in a distributed environment with multiple services which may even run on different cloud platforms. With performance testing, we try to identify performance and resilience issues in these kinds of environments.

A Practical Guide to Chaos Engineering

Modern systems built on cloud technologies and microservices architecture have a lot of dependencies on the internet, infrastructure, and services that you do not have control over. We cannot control or avoid failures in distributed systems, but we can control the impact radius of the failure and optimize the time to recover and restore the systems. This can be achieved only by exercising as many failures as we can in the test lab, thus achieving confidence in the system’s resilience., says Jitendra Nath Lella, Senior Architect, Delivery, Cigniti Technologies.

Embracing Failure With Gremlin Chaos Engineering

In this episode of Kongcast, I spoke with Jason Yee, director of advocacy at Gremlin, about the concept of chaos engineering, why even the best engineers can’t control everything, and tools and tactics to help build app resiliency. Check out the transcript and video from our conversation below, and be sure to subscribe to get email alerts for the latest new episodes.

Chaos testing with k6, Prometheus, and Grafana (Schrödinger's Pokémon)

In this presentation, Nicole van der Hoeven talks about what chaos engineering is, what it has to do with testing, and how to run chaos experiments and load tests in the same script, using k6. She also talks about how to set up Grafana and Prometheus for observability during chaos testing.

How to get started with chaos engineering (k6 Office Hours)

Chaos engineering and performance testing ultimately have the same goal: improving software reliability. But chaos engineering hasn't really been picked up by many testers yet. Nicole shares her experiences in running her first chaos experiments using k6 and New Relic. k6 Office Hours is a weekly livestream hosted by Nicole van der Hoeven and Simon Aronsson to talk about performance and reliability testing, best practices, and all things k6.