Course curriculum

  • 1

    Workshop Introduction

    • Course Introduction

    • Course Goal

    • Get Certified

    • Course Completion Certificate Sample

  • 2

    Module 1: SRE Anti-PAtterns

  • 3

    Module 2: SLO is the Proxy for Customer Happiness

    • Exercise - How do you establish distributed ecosystems?

    • 2.1 What has changed with SLO?

    • Case story - Kudos Engineering

    • 2.2 Identifying System boundaries for setting SLIs is critical

    • 2.3 How do you use Error Budgets beyond the velocity versus stability debate?

    • Video - SLI/SLOs Deep Dive David Blank Edelman (10:01)

      FREE PREVIEW
    • Case story - Home Depot’s SLO Journey

    • Case story: CLOUD SLA Example

    • Module 2 Quiz

  • 4

    Module 3: Building Secure and Reliable Systems

    • 3.1 Building Secure and Reliable systems

    • Discussion - Non-Abstract Large Scale Design

    • 3.2 Non-Abstract Large Scale Design

    • 3.3 Designing for the changing Architecture and distributed ecosystem

    • 3.4 Fault tolerant Design

    • 3.5 Designing for Security

    • 3.6 Designing for Resiliency, Scalability, Performance, Availability, Reliability

    • Video - Building Secure & Reliable Systems Heather Adkins (12:23)

      FREE PREVIEW
    • 3.7 Data Security and Privacy

    • Case Story - Chrome Security Team

    • Module 3 Quiz

  • 5

    Module 4: Full Stack Observability

    • 4.1 Modern Apps are Complex & Unpredictable

    • Discussion - How do you instrument Full Stack Observability?

    • 4.2 Pillars of Observability

    • Video - OpenTelemetry Constance Caramanolis (14:12)

      FREE PREVIEW
    • Video - Prometheus and Zipkin Tom Wilkie (3:44)

    • Case Story - Planet Labs

    • Discussion - How do you bake Observability in your code?

    • Module 4 Quiz

  • 6

    Module 5: Using Platform Engineering and AIOps

    • 5.1 Taking a Platform Centric View

    • 5.2 How do you use AIOps to improve Resiliency

    • 5.3 How can DataOps help you in the journey

    • 5.4 A simple recipe to implement AIOps

    • Video - AI in Ops Stylianos Kampakis(7:15)

      FREE PREVIEW
    • Case Story - FedEx

    • 5.5 Indicative measurement of AIOp

    • Case Story - 3M

    • Discussion - Discuss how AIOps can help in your operations based on the FedEx Case Story

    • Module 5 Quiz

  • 7

    Module 6: SRE and Incident Response Management

    • 6.1 SRE Key Responsibilities towards incident response

    • 6.2 DevOps & SRE and ITSM (new vs. old ways)

    • 6.3 OODA and SRE Incident Response

    • 6.4 SRE and CLR (closed loop remediation)

    • 6.5 Swarming – Food for Thought

    • 6.6 AI/ML for better Incident Management

    • Discussion - Discuss Swarming versus Traditional NOC/3-Layer Support

    • Video - Runbook Automation Damon Edwards (16:02)

    • Case Story - HCL Incident Improvement Journey

    • Module 6 Quiz

  • 8

    Module 7: Chaos Engineering

    • 7.1 Navigating Complexity

    • 7.2 Chaos Engineering Defined

    • 7.3 Quick Facts

    • 7.4 Chaos Monkey Origin Story

    • 7.5 Who is adopting Chaos Engineering

    • 7.6 Myths of Chaos

    • Discussion - Instrumenting Gremlin

    • 7.7 Chaos Engineering Experiments

    • 7.8 GameDay Exercises

    • 7.9 Security Chaos Engineering

    • Video - Practical Chaos Engineering Adrian Hornsby (16:02)

      FREE PREVIEW
    • 7.10 Chaos Engineering Resources

    • Module 7 Quiz

  • 9

    Module 8: SRE is the Purest Form of DevOps

    • 8.1 Key Principles of SRE

    • 8.2 SREs help increase Reliability across the spectrum

    • 8.3 Metrics for Success

    • 8.4 SRE Execution models

    • 8.5 Culture and Behavioral Skills are key

    • 8.6 Implementation Roadmap

    • Discussion - Discuss a real SRE Case Story

    • Discussion - Why are SREs huge collaborators?

    • Exercise - Discuss NALSD learning from Module 2

    • 8.7 Case Study: Transformation after implementing SRE practices

    • Case Story - Airbnb

    • Module 8 Quiz

  • 10

    Resources

    • Catchpoint - SRE Report 2021

    • Why SRE Documents Matter

    • SRE Foundation Blueprint

    • SRE Practitioner Blueprint

    • SRE Practitioner Exam Requirement

    • SRE Practitioner Certification Sample

    • A Typology of Organisational Cultures

    • Quiz Answers

  • 11

    Course Feedback

    • Feedback - your feedback is important to us, let us know how you feel about this learning.