What's Up Time? The Intro to Site Reliability Engineering Workshop

Help your team minimize risk while speeding up deployment

Resources

When was the last time your team broke something in production? Why did it happen? What was impacted? Wouldn’t it be great if there was a set of practices you could implement to avoid such a headache?

Join thoughtbot Senior Developer and SRE expert, Clarissa Borges, for a free and live virtual workshop all about Site Reliability Engineering (SRE). SRE leverages operations data and software engineering to automate IT operations tasks, accelerate software delivery, and minimize IT risk. Clarissa will walk you through the fundamental SRE principles and guide you through exercises designed to teach you how to create SRE standards like Service Level Objectives (SLOs) for your own organization.

This workshop will include:

  • Overview of SRE concepts and practices
  • Introduction to the Service Level Objectives Lifecycle
  • Interactive exercises to help you apply SRE practices to your use cases
  • A sneak peek of thoughtbot’s observability infrastructure

This workshop is ideal for:

  • Engineering leaders looking to mature their team’s processes and product reliability
  • Developers curious about monitoring, observability and SRE and how they can help your team
  • Organizations that have multiple developer teams deploying frequently
  • Anyone frustrated by outages
Meet your workshop leader
  • Clarissa Lima Borges, Senior Developer

    Clarissa is a Senior Developer and Consultant on thoughtbot’s platform engineering team, where she has made significant contributions to implementing Site Reliability Engineering (SRE) methodologies. She drives efforts to follow observability, monitoring, and incident response best practices using Prometheus and Grafana. Clarissa actively mentors fellow developers on SRE methodologies and shares her insights, presenting talks and giving workshops, both internally and externally.