article thumbnail

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

As a result, site reliability has emerged as a critical success metric for many organizations. Uptime Institute’s 2022 Outage Analysis report found that over 60% of system outages resulted in at least $100,000 in total losses, up from 39% in 2019. More than one in seven outages cost more than $1 million. availability.

article thumbnail

Dynatrace Managed release notes version 1.230

Dynatrace

This eliminates certain timeout issues with Windows Server 2019 at boot. Custom events for alerting using the Build tab and advanced query mode now apply the same metric dimension limits that are applied to Code -tab-based configurations. Real user traffic has been added to the world map on the browser monitor details page.

Metrics 205
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Wednesday?—?December In this session, we present our human-centric design principles that enable the autonomy our engineers enjoy.

AWS 100
article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Wednesday?—?December In this session, we present our human-centric design principles that enable the autonomy our engineers enjoy.

AWS 100
article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. In 2019, Netflix moved thousands of container hosts to bare metal. Wednesday?—?December

AWS 37
article thumbnail

Reducing High CPU on MySQL: a Case Study

Percona

The customer had noticed a linear high CPU usage in one of their MySQL instances and was not able to figure out why as there was no much traffic hitting the app. With that we decided to look into MySQL metrics. 2019-02-22T18:26:25 Creating triggers. 2019-02-22T18:27:14 Created triggers OK. Creating new table.

C++ 99
article thumbnail

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

We took a hybrid head-based sampling approach that allows for recording 100% of traces for a specific and configurable set of requests, while continuing to randomly sample traffic per the policy set at ingestion point. This allowed us to increase total storage capacity without adding a new Cassandra node to the existing cluster.