article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This approach has a handful of benefits.

Traffic 339
article thumbnail

Migrating Netflix to GraphQL Safely

The Netflix TechBlog

The control group’s traffic utilized the legacy Falcor stack, while the experiment population leveraged the new GraphQL client and was directed to the GraphQL Shim. To determine customer impact, we could compare various metrics such as error rates, latencies, and time to render.

Traffic 353
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

The Dynatrace 2022 Global CIO Report found that 71% of top IT executives say the explosion of data produced by cloud-native technology stacks is beyond human ability to manage, and more than three-quarters say their IT environment changes once every minute or less. Visibility and automation are two of the most important SRE tools.

article thumbnail

Build and operate multicloud FaaS with enhanced, intelligent end-to-end observability

Dynatrace

As companies accelerate digital transformation, they implement modern cloud technologies like serverless functions. According to Flexera , serverless functions are the number one technology evaluated by enterprises and one of the top five cloud technologies in use at enterprises. And serverless support is a core capability.

article thumbnail

Rebuilding Netflix Video Processing Pipeline with Microservices

The Netflix TechBlog

This architecture shift greatly reduced the processing latency and increased system resiliency. We expanded pipeline support to serve our studio/content-development use cases, which had different latency and resiliency requirements as compared to the traditional streaming use case. divide the input video into small chunks 2.

article thumbnail

SLOs done right: how DevOps teams can build better service-level objectives

Dynatrace

Monitors signals The first attribute of a good SLO is the ability to monitor the four “golden signals”: latency, traffic, error rates, and resource saturation. Bridges the business gap Good SLOs should also help bridge the gap between chief executive officers’ concerns and those of chief technology officers.

DevOps 211
article thumbnail

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

The network latency between cluster nodes should be around 10 ms or less. We’re therefore proud to announce Premium High Availability for Dynatrace Managed to address exactly this need in a completely turnkey manner without any external load balancing or replication technologies. Minimized cross-data center network traffic.