I have been reading a lot lately about the principles of chaos engineering and how breaking things on purpose can actually make a system much more resilient in the long run.
My team wants to start injecting some random faults into our microservices architecture to see if our self-healing capabilities are actually working as advertised in the documentation.
we specifically want to corrupt files for chaos engineering tests to simulate a failing disk or a noisy network connection between our primary and secondary data centers. It’s an exciting way to build more robust and reliable software.