SREcon19 Americas has ended
Back To Schedule
Monday, March 25 • 10:30am - 11:00am
Case Study: Implementing SLOs for a New Service

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Implementing service level objectives (SLOs) effectively is a hard task, especially for a service which not only is new within your engineering and product organizations but also encompasses both a request-driven and a storage subsystem.

In this talk, I will discuss our experience defining and measuring service level indicators (SLIs) and objectives for our Ceph Object Storage service. I will describe our approach in specifying service level indicators plus the tradeoffs and implementation decisions we made when it came to measuring various types of SLIs, including availability, latency, and durability.

I will also share the lessons learned and benefits gained from our implementation. You will understand why SLOs are crucial for site reliability engineers and service users and will be given some tips on how to implement them for either a request-driven or a storage system.

avatar for Arnaud Lawson

Arnaud Lawson

Arnaud is a Senior Site Reliability Engineer at Squarespace in New York, where—among other things—he has led the productionization of Ceph as a storage backend used by many Squarespace services.

Monday March 25, 2019 10:30am - 11:00am EDT
Grand Ballroom ABC