article thumbnail

Site reliability engineering: 5 things you need to know

Dynatrace

As a discipline, SRE focuses on improving software system reliability across key categories including availability, performance, latency, efficiency, capacity, and incident response. SRE applies DevOps principles to developing systems and software that help increase site reliability and performance.

article thumbnail

Site reliability engineering: 5 things to you need to know

Dynatrace

As a discipline, SRE focuses on improving software system reliability across key categories including availability, performance, latency, efficiency, capacity, and incident response. SRE applies DevOps principles to developing systems and software that help increase site reliability and performance.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

What is a Site Reliability Engineer (SRE)?

Dotcom-Montior

The term site reliability engineering first came into existence at Google in 2003 when a site reliability team was created. that are required to keep the software deployments live are running efficiently. Performance. At that time, the team was made up of software engineers. What Does a Site Reliability Engineer Do? Monitoring.

article thumbnail

Supercomputing Predictions: Custom CPUs, CXL3.0, and Petalith Architectures

Adrian Cockcroft

on Myths and Legends of High Performance Computing  — it’s a somewhat light-hearted look at some of the same issues by the leader of the team that built the Fugaku system I mention below. Jack Dongarra talked about the scores, and pointed out the low efficiency on some important workloads.

article thumbnail

Why Waits Alone Are Not Enough

SQL Performance

"Waitstats helps us identify performance-related counters. But waits information by itself isn't enough to accurately diagnose performance problems. The queues component of our methodology comes from Performance Monitor counters, which provide a view of system performance from a resource standpoint.".

Tuning 115
article thumbnail

HammerDB Best Practice for PostgreSQL Performance and Scalability

HammerDB

This post gives a HOWTO guide on system configuration for achieving top levels of performance with the HammerDB PostgreSQL TPC-C test. Firstly for system choice a 2 socket system is optimal for PostgreSQL OLTP performance at the time of writing. Make sure that your CPU is configured for optimal performance. Setting cpu: 0.

article thumbnail

The Back-to-Basics Readings of 2012 - All Things Distributed

All Things Distributed

Jul 4 - Leases: An efficient fault-tolerant mechanism for distributed file cache consistency , Gray, Cary, and David Cheriton, Vol. Harris, Alex Ho, Rolf Neugebauer, Ian Pratt, Andrew Warfield, in the Proceedings of the 19th ACM Symposium on Operating Systems Principles, October 19-22, 2003, Bolton Landing, NY USA. Gray and David R.