Skip to content

SRE Learning Materials



Site Reliability Engineering (SRE) is a relatively new field that has been gaining popularity in recent years. SRE teams are responsible for ensuring the reliability, performance, and efficiency of complex systems. To become an SRE, you need to have a solid understanding of computer science fundamentals, as well as a deep knowledge of distributed systems, networking, and cloud infrastructure. In this blog post, we will discuss the learning materials available at the School of SRE that can help you gain the knowledge and skills needed to become an SRE.

Online Courses

The School of SRE offers several online courses that cover a wide range of topics related to SRE. These courses are designed to be self-paced, and they include a mix of lectures, hands-on labs, and quizzes to help you reinforce your learning. Some of the courses available include “Introduction to SRE”, “Distributed Systems”, “Cloud Infrastructure”, “Networking”, and “Monitoring and Alerting.” These courses are perfect for those who prefer a structured learning experience and want to earn certificates upon completion.


Book & Course

  • (Book) Site Reliability Engineering -
  • (Book) Site Reliability Workbook -
  • (Book) Building Secure and Reliable Systems -
  • (Course) Intro to DevOps -
  • (Course) Google Cloud Platform for Systems Operations -
  • (Course) Measuring and Managing Reliability -

Operating Systems

  • (Course) Introduction to Operating Systems -
  • (Course) Advanced Operating Systems -


  • (Tutorial) Ansible -
  • (Course) Terraform -

Distributed Systems

  • (Tutorial) Introduction to Distributed Systems Design -


  • (Book) Understanding Linux Network Internals -

Programming Languages


  • (Book) Learn Python 3 The Hard Way -
  • (Course) Developing Scalable Apps in Python -


  • (Book) The Go Programming Language -
  • (Webinar) Go Language for Ops and Site Reliability Engineering -
  • (Hands On)

Production Web App

  • (Tutorial) Building for Production: Web Applications -
  • (Book) Production Ready Microservices -

Monitoring and Logging

  • (Course) Monitoring and Alerting with Prometheus -
  • (Book) Prometheus UP and Running -

Continuous Integration | Continuous Delivery

  • (Course) Continuous Deliver Better Software -


  • (Course) Docker for Devops -

Web Servers


  • (Course) Nginx Fundamentals -

Cluster Management


  • (Tutorial) Kubernetes Bootcamp -
  • (Course) Scalable Microservices with Kubernetes -
  • (Tutorial) Kubernetes Tutorial for Beginners -


Amazon AWS

  • (Tutorial) Amazon AWS -


  • Post-Mortem Template -



DevOps | SRE Roadmap

  • DevOps Roadmap -

SRE Interview