article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

The first phase involves validating functional correctness, scalability, and performance concerns and ensuring the new systems’ resilience before the migration. These include Quality-of-Experience(QoE) measurements at the customer device level, Service-Level-Agreements (SLAs), and business-level Key-Performance-Indicators(KPIs).

Traffic 339
article thumbnail

Interpreting A/B test results: false negatives and power

The Netflix TechBlog

We then used simple thought exercises based on flipping coins to build intuition around false positives and related concepts such as statistical significance, p-values, and confidence intervals. As a result, if the test treatment results in a small reduction in the latency metric, it’s hard to successfully identify?

Testing 202
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

For each route we migrated, we wanted to make sure we were not introducing any regressions: either in the form of missing (or worse, wrong) data, or by increasing the latency of each endpoint. Being able to canary a new route let us verify latency and error rates were within acceptable limits. This meant that data that was static (e.g.

Latency 233
article thumbnail

Bring Your Own Cloud (BYOC) vs. Dedicated Hosting at ScaleGrid

Scalegrid

While this is a good way to get a rough estimate, your monthly cloud costs will indeed vary based on the amount of backups performed and your data transfer activity. Deploying your application and database on the same VPC also provides the lowest possible latency path. ScaleGrid BYOC Pricing: $232/month. Reserved Instances. Expert Tip.

Cloud 242
article thumbnail

Service level objectives: 5 SLOs to get started

Dynatrace

Service level objectives (SLOs) provide a powerful framework for measuring and maintaining software performance, reliability, and user satisfaction. SLOs are a valuable tool for organizations to ensure the health and performance of their applications. This SLO enables a smooth and uninterrupted exercise-tracking experience.

Latency 176
article thumbnail

Service level objective examples: 5 SLO examples for faster, more reliable apps

Dynatrace

Service level objectives (SLOs) provide a powerful framework for measuring and maintaining software performance, reliability, and user satisfaction. Teams can build on these SLO examples to improve application performance and reliability. This SLO enables a smooth and uninterrupted exercise-tracking experience.

Traffic 174
article thumbnail

COVID-19 Hazard Analysis using STPA

Adrian Cockcroft

There are many possible failure modes, and each exercises a different aspect of resilience. Consider how controls could degrade over time, including change management, performance audits and how incident reviews could surface anomalies and problems with the system design. Memorial at Hawaiian native burial ground, Kapalua, Maui.