article thumbnail

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

Keeping pace with modern digital transformation requires ensuring that applications are responsive, resilient, and always available amid increased complexity. Uptime Institute’s 2022 Outage Analysis report found that over 60% of system outages resulted in at least $100,000 in total losses, up from 39% in 2019.

article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

December 2 1pm-2pm CMP 326-R Capacity Management Made Easy with Amazon EC2 Auto Scaling Vadim Filanovsky , Senior Performance Engineer & Anoop Kapoor, AWS Abstract :Amazon EC2 Auto Scaling offers a hands-free capacity management experience to help customers maintain a healthy fleet, improve application availability, and reduce costs.

AWS 100
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

December 2 1pm-2pm CMP 326-R Capacity Management Made Easy with Amazon EC2 Auto Scaling Vadim Filanovsky , Senior Performance Engineer & Anoop Kapoor, AWS Abstract :Amazon EC2 Auto Scaling offers a hands-free capacity management experience to help customers maintain a healthy fleet, improve application availability, and reduce costs.

AWS 100
article thumbnail

ScyllaDB Trends – How Users Deploy The Real-Time Big Data Database

Scalegrid

ScyllaDB offers significantly lower latency which allows you to process a high volume of data with minimal delay. percentile latency is up to 11X better than Cassandra on AWS EC2 bare metal. This number is more inline with our recent 2019 Open Source Database Trends Report where 56.9% The remaining 13.0%

Big Data 187
article thumbnail

The Netflix Cosmos Platform

The Netflix TechBlog

It supports both high throughput services that consume hundreds of thousands of CPUs at a time, and latency-sensitive workloads where humans are waiting for the results of a computation. The subsystems all communicate with each other asynchronously via Timestone, a high-scale, low-latency priority queuing system. Warm capacity.

article thumbnail

Keeping Netflix Reliable Using Prioritized Load Shedding

The Netflix TechBlog

The computation is done as a first step so that it is available for the rest of the request lifecycle. Those two metrics are approximate indicators of failures and latency. Requests with higher priority will retry more aggressively than lower ones, also increasing streaming availability.

Traffic 252
article thumbnail

Connecting MongoDB to Ruby with Self-Signed Certificates for SSL

Scalegrid

The connection options available for the driver are documented here , and the options we will need are: :ssl. First, find and copy your MongoDB connection string from the cluster details page on the ScaleGrid console: The CA certificate file is also available for download from the cluster details page. ssl_verify. ssl_ca_cert.

C++ 100