In the context of extensive applications, it’s not uncommon to see hundreds of microservices interacting with each other, which can quickly become...
Latest posts
Introduction to service mesh – Implementing Traffic Management, Security, and Observability with Istio-1
Imagine being in a bustling city with a complex network of roads and highways. You’re driving your car from one side of the city to the other. In...
Revisiting the Blog App – Implementing Traffic Management, Security, and Observability with Istio
Since we discussed the Blog App previously, let’s look at the services and their interactions again: Figure 15.1 – The Blog App and its...
Code development – The Role of AI in DevOps
This area is where we see the most significant impact of generative AI and other AI technologies. AI revolutionizes code development by automating...
Setting up the baseline – Implementing Traffic Management, Security, and Observability with Istio
To ensure continuity with the previous chapters, let’s start by creating a service account for Terraform so that we can interact with our GCP...
Technical requirements – Implementing Traffic Management, Security, and Observability with Istio
In the previous chapter, we covered site reliability engineering (SRE) and how it has helped manage production environments using DevOps practices....
Alerting with Grafana – The Role of AI in DevOps
The recent developments in artificial intelligence (AI) with the launch of generative AI using ChatGPT have taken the tech industry by storm. It...
Running distributed applications in production – Understanding Key Performance Indicators (KPIs) for Your Production Service
So far, we’ve been discussing KPIs for running an application in production, taking inspiration from SRE principles. Now, let’s understand how we...
Disaster recovery, RTO, and RPO – Understanding Key Performance Indicators (KPIs) for Your Production Service
Disaster recovery is a comprehensive strategy that’s designed to ensure an organization’s resilience in the face of unexpected, disruptive events,...
Error budgets – Understanding Key Performance Indicators (KPIs) for Your Production Service
As defined by Liz Fong-Jones and Seth Vargo, error budgets represent “a quantitative measure shared between product and SRE teams to balance...