RSS Feed Subscribe to RSS Feed

 

Creating an SRE team

If you wanted to build an SRE team at your company, how would you go about it? How would you structure it?

(more…)

Tags: , , ,

SRE Resources

The following are a list of SRE resources I’m finding useful. I will update it as I find more. The good news is that most of the books (including all 3 of the Google SRE books) are available for free download at https://landing.google.com/sre/books.

(more…)

Tags: , , , , ,

SLI, SLO and SLA

What are SLIs, SLOs and SLAs? 

Service Level Indicators (SLIs) are metrics that you choose to measure the health and performance of your services. Service Level Objectives (SLOs) are the desired target for those indicators. Service Level Agreements (SLAs) build on this and include the consequences of not meeting those targets. All are fundamental to Site Reliability Engineering.

In this post, I’ll try to explain each in more detail, how they relate to each other, and some examples of each.

(more…)

Tags: , , , , , , , , , ,

SRE vs DevOps

I’m really enjoying the Seeking SRE book. Chapter 12 covers SRE vs DevOps; a community sourced compare and contrast type discussion.

My favorite description is from Thomas Limoncelli, who suggested that:

DevOps engineers focus on the SDLC pipeline with occasional responsibilities for production operations. SREs focus on production operations with occasional responsibilities for the SDLC pipeline.

(more…)

Tags: , , , ,