Traditionally, software testing has taken more of a manual approach because most developers don’t want software testing as a full-time profession....
Category: Traffic mirroring
Introduction to service mesh – Implementing Traffic Management, Security, and Observability with Istio-2
In the context of extensive applications, it’s not uncommon to see hundreds of microservices interacting with each other, which can quickly become...
Code development – The Role of AI in DevOps
This area is where we see the most significant impact of generative AI and other AI technologies. AI revolutionizes code development by automating...
Running distributed applications in production – Understanding Key Performance Indicators (KPIs) for Your Production Service
So far, we’ve been discussing KPIs for running an application in production, taking inspiration from SRE principles. Now, let’s understand how we...
Disaster recovery, RTO, and RPO – Understanding Key Performance Indicators (KPIs) for Your Production Service
Disaster recovery is a comprehensive strategy that’s designed to ensure an organization’s resilience in the face of unexpected, disruptive events,...
Error budgets – Understanding Key Performance Indicators (KPIs) for Your Production Service
As defined by Liz Fong-Jones and Seth Vargo, error budgets represent “a quantitative measure shared between product and SRE teams to balance...
Alerting with Grafana – Implementing Traffic Management, Security, and Observability with Istio
To initiate the alerting process, it’s crucial to establish clear criteria. Given the limited volume at hand, simulating an accurate SLO breach can...
SLAs – Understanding Key Performance Indicators (KPIs) for Your Production Service
SLAs According to Google, SLAs are “formal or implicit agreements with your users that outline the repercussions of meeting (or failing to meet)...
SLOs – Understanding Key Performance Indicators (KPIs) for Your Production Service
SLOs Google’s definition of SLOs states that they “establish a target level for the reliability of your service.” They specify the percentage of...
Understanding the importance of reliability – Understanding Key Performance Indicators (KPIs) for Your Production Service-2
In summary, software reliability is not just a technical concern; it has wide-reaching implications for user satisfaction, business success, and...