Remove Java Remove Latency Remove Strategy Remove Traffic
article thumbnail

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

Over the course of this post, we will talk about our approach to this migration, the strategies that we employed, and the tools we built to support this. service with a composable JavaScript API that made downstream microservice calls, replacing the old Java API. Java…Script? It was a Node.js Replay Testing Enter replay testing.

Latency 233
article thumbnail

Data Reprocessing Pipeline in Asset Management Platform @Netflix

The Netflix TechBlog

Existing data got updated to be backward compatible without impacting the existing running production traffic. Data Sharding strategy in elasticsearch is updated to provide low search latency (as described in blog post) Design of new Cassandra reverse indices to support different sets of queries.

Media 237
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

If we had an ID for each streaming session then distributed tracing could easily reconstruct session failure by providing service topology, retry and error tags, and latency measurements for all service calls. We chose Open-Zipkin because it had better integrations with our Spring Boot based Java runtime environment.

article thumbnail

Migrating Netflix to GraphQL Safely

The Netflix TechBlog

The three strategies we will discuss today are AB Testing , Replay Testing, and Sticky Canaries. A single API team maintained both the Java implementation of the Falcor framework and the API Server. Let’s discuss the three testing strategies in further detail. To launch Phase 2 safely, we used Replay Testing and Sticky Canaries.

Traffic 353
article thumbnail

A Management Maturity Model for Performance

Alex Russell

This is a complex topic, but to borrow from a recent post , web performance expands access to information and services by reducing latency and variance across interactions in a session, with a particular focus on the tail of the distribution (P75+). Consistent performance matters just as much as low average latency.

article thumbnail

Data Movement in Netflix Studio via Data Mesh

The Netflix TechBlog

Operational Reporting is a reporting paradigm specialized in covering high-resolution, low-latency data sets, serving detailed day-to-day activities¹ and processes of a business domain. CDC events can also be sent to Data Mesh via a Java Client Producer Library.

Big Data 253
article thumbnail

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

Nonetheless, we found a number of limitations that could not satisfy our requirements e.g. stalling the processing of log events until a dump is complete, missing ability to trigger dumps on demand, or implementations that block write traffic by using table locks. Blocking write traffic by locking tables. Writing events to any output.

Database 197