article thumbnail

Site reliability engineering: 5 things you need to know

Dynatrace

What is site reliability engineering? Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. SRE bridges the gap between Dev and Ops teams.

article thumbnail

Site reliability engineering: 5 things to you need to know

Dynatrace

Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. ” According to Google, “SRE is what you get when you treat operations as a software problem.”

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

What is a Site Reliability Engineer (SRE)?

Dotcom-Montior

A site reliability engineer, or SRE, is a role that that encompasses aspects of both software engineering and operations/infrastructure. The term site reliability engineering first came into existence at Google in 2003 when a site reliability team was created. What are Some Common SRE Responsibilities?

article thumbnail

The Back-to-Basics Readings of 2012 - All Things Distributed

All Things Distributed

Harris, Alex Ho, Rolf Neugebauer, Ian Pratt, Andrew Warfield, in the Proceedings of the 19th ACM Symposium on Operating Systems Principles, October 19-22, 2003, Bolton Landing, NY USA. Sep 7 - Adaptive load sharing in homogeneous distributed systems , D Eager, ED Lazowska and J Zahorjan - IEEE transactions on software engineering, 1986.