article thumbnail

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

Example use case: Content Knowledge Graph Our knowledge graph of the entertainment world encodes relationships between titles, actors and other attributes of a film or series, supporting all aspects of business at Netflix. In other cases, it is more convenient to share the results via a low-latency API.

Systems 226
article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. Wednesday?—?December

AWS 100
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. Wednesday?—?December

AWS 100
article thumbnail

Why Traditional Monitoring Isn’t Enough for Modern Web Applications

Dotcom-Montior

Users who rely on the websites for their fundamental needs or entertainment will not tolerate even a few seconds delay. There are certain metrics to be considered for a user to have a hassle-free experience. Network latency. Network Latency. Network latency can be affected due to. Connection time. Wi-Fi usage.

article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. Wednesday?—?December

AWS 37
article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

Behind these perfect moments of entertainment is a complex mechanism, with numerous gears and cogs working in harmony. By collecting and analyzing key performance metrics of the service over time, we can assess the impact of the new changes and determine if they meet the availability, latency, and performance requirements.

Traffic 279
article thumbnail

Working at Netflix 2017

Brendan Gregg

A latency outlier issue that happened every 15 minutes. I'm also monitoring various chatrooms and metrics, and will jump in when needed. . - MSR analysis on a instance to show it was running at a lower clock rate. - Analyzing slab memory growth on a instance with containers. Getting flame graphs to work in a new environment.

Java 75