Remove Exercise Remove Latency Remove Metrics Remove Testing
article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

The second phase involves migrating the traffic over to the new systems in a manner that mitigates the risk of incidents while continually monitoring and confirming that we are meeting crucial metrics tracked at multiple levels. It provides a good read on the availability and latency ranges under different production conditions.

Traffic 339
article thumbnail

Interpreting A/B test results: false negatives and power

The Netflix TechBlog

Martin Tingley with Wenjing Zheng , Simon Ejdemyr , Stephanie Lane , and Colin McFarland This is the fourth post in a multi-part series on how Netflix uses A/B tests to inform decisions and continuously innovate on our products. Have a look at Part 1 (Decision Making at Netflix), Part 2 (What is an A/B Test?), Need to catch up?

Testing 202
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

To prepare ourselves for a big change in the tech stack of our endpoint, we decided to track metrics around the time taken to respond to queries. After some consultation with our backend teams, we determined the most effective way to group these metrics were by UI screen. For the migration, testing was a first-class citizen.

Latency 233
article thumbnail

Bring Your Own Cloud (BYOC) vs. Dedicated Hosting at ScaleGrid

Scalegrid

Deploying your application and database on the same VPC also provides the lowest possible latency path. Use Follower Clusters keep two independent database systems (of the same type) in sync so you can analyze, optimize and test app performance for MySQL, PostgreSQL and MongoDB® database. Expert Tip. Security Groups. No problem.

Cloud 242
article thumbnail

COVID-19 Hazard Analysis using STPA

Adrian Cockcroft

There are many possible failure modes, and each exercises a different aspect of resilience. A learning organization, disaster recovery testing, game days, and chaos engineering tools are all important components of a resilient system. Is the model of the controlled process looking at the right metrics and behaving safely?

article thumbnail

Service level objectives: 5 SLOs to get started

Dynatrace

Certain SLOs can help organizations get started on measuring and delivering metrics that matter. Fitness app : The fitness app should offer a response time of less than 500 milliseconds for exercise tracking and data recording. This SLO enables a smooth and uninterrupted exercise-tracking experience.

Latency 179
article thumbnail

Service level objective examples: 5 SLO examples for faster, more reliable apps

Dynatrace

Certain service-level objective examples can help organizations get started on measuring and delivering metrics that matter. Fitness app : The fitness app should offer a response time of less than 500 milliseconds for exercise tracking and data recording. This SLO enables a smooth and uninterrupted exercise-tracking experience.

Traffic 173