Latency, Metrics, Traffic and Video - Technology Performance Pulse

Implementing service-level objectives to improve software quality

Dynatrace

DECEMBER 27, 2022

By implementing service-level objectives, teams can avoid collecting and checking a huge amount of metrics for each service. First, it helps to understand that applications and all the services and infrastructure that support them generate telemetry data based on traffic from real users. So how can teams start implementing SLOs?

Software

Software Software Benchmarking Latency

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

SEPTEMBER 8, 2020

As an example, to render the screen shown here, the app sends a query that looks like this: paths: ["videos", 80154610, "detail"] A path starts from a root object , and is followed by a sequence of keys that we want to retrieve the data for. Instead, it is part of a different path : [videos, <id>, similars].

Latency

Latency Cache Java Traffic

Data Reprocessing Pipeline in Asset Management Platform @Netflix

The Netflix TechBlog

MARCH 10, 2023

Existing data got updated to be backward compatible without impacting the existing running production traffic. Data Sharding strategy in elasticsearch is updated to provide low search latency (as described in blog post) Design of new Cassandra reverse indices to support different sets of queries.

Media

Media Traffic Processing Design

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 2 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Picture yourself enthralled by the latest episode of your beloved Netflix series, delighting in an uninterrupted, high-definition streaming experience. This is where large-scale system migrations come into play.

Traffic

Traffic Metrics Systems Strategy

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

Investigating a video streaming failure consists of inspecting all aspects of a member account. If we had an ID for each streaming session then distributed tracing could easily reconstruct session failure by providing service topology, retry and error tags, and latency measurements for all service calls.

Infrastructure

Infrastructure Transportation Storage Open Source

Percentiles don’t work: Analyzing the distribution of response times for web services

Adrian Cockcroft

JANUARY 29, 2023

There is no way to model how much more traffic you can send to that system before it exceeds it’s SLA. I presented this analysis of response time distributions talk in 2016 — at Microxchg in Berlin ( video ). Mu is the mean of each component, the latency. I’ve been thinking about this for a long time.

Lambda

Lambda Latency Cache C++

Migrating Netflix to GraphQL Safely

The Netflix TechBlog

JUNE 14, 2023

We could also swap out the implementation of a field from GraphQL Shim to Video API with federation directives. So, we relied on higher-level metrics-based testing: AB Testing and Sticky Canaries. To determine customer impact, we could compare various metrics such as error rates, latencies, and time to render.

Traffic

Traffic Latency Cache Metrics

Netflix Video Quality at Scale with Cosmos Microservices

The Netflix TechBlog

NOVEMBER 2, 2021

Moorthy and Zhi Li Introduction Measuring video quality at scale is an essential component of the Netflix streaming pipeline. Perceptual quality measurements are used to drive video encoding optimizations , perform video codec comparisons , carry out A/B testing and optimize streaming QoE decisions to mention a few.

Media

Media Innovation Metrics Latency

Service level objectives: 5 SLOs to get started

Dynatrace

JUNE 1, 2023

Certain SLOs can help organizations get started on measuring and delivering metrics that matter. With this objective, the app ensures that users experience real-time feedback and immediate updates when logging workouts, recording sets and reps, or tracking performance metrics. Latency primarily focuses on the time spent in transit.

Latency

Latency Website Traffic Virtualization

Service level objective examples: 5 SLO examples for faster, more reliable apps

Dynatrace

JUNE 1, 2023

Certain service-level objective examples can help organizations get started on measuring and delivering metrics that matter. With this objective, the app ensures that users experience real-time feedback and immediate updates when logging workouts, recording sets and reps, or tracking performance metrics. The Apdex score of 0.85

Traffic

Traffic Latency Website Virtualization

Edgar: Solving Mysteries Faster with Observability

The Netflix TechBlog

SEPTEMBER 2, 2020

Edgar captures 100% of interesting traces , as opposed to sampling a small fixed percentage of traffic. Tracing as a foundation Logs, metrics, and traces are the three pillars of observability. Telltale provides Edgar with latency benchmarks that indicate if the individual trace’s latency is abnormal for this given service.

Latency

Latency Transportation Engineering Traffic

How We Optimized Performance To Serve A Global Audience

Smashing Magazine

AUGUST 3, 2023

It increases our visibility and enables us to draw a steady stream of organic (or “free”) traffic to our site. While paid marketing strategies like Google Ads play a part in our approach as well, enhancing our organic traffic remains a major priority. The higher our organic traffic, the more profitable we become as a company.

Performance

Performance Cache Traffic Metrics

Why Telcos Need a Real-Time Analytics Strategy

VoltDB

JUNE 9, 2017

Watch the Video of our Telco Expert. downgrading the streaming rate on video). These systems drive deep insights into network engineering and planning, capacity management, customer retention, and other key metrics. There is a class of real-time data that has not been used to benefit telco insights.

Analytics

Analytics Strategy Network Games

Why Telcos Need a Real-Time Analytics Strategy

VoltDB

JUNE 9, 2017

downgrading the streaming rate on video). These systems drive deep insights into network engineering and planning, capacity management, customer retention, and other key metrics. For passive services like email, this wouldn’t be as necessary. Will a subscriber really know that an email that arrives five seconds late is “late”?

Analytics

Analytics Strategy Network Games

Automating chaos experiments in production

The Morning Paper

JULY 4, 2019

Moreover, just like an A/B test, we’ll be collecting metrics while the experiment is underway and performing statistical analysis at the end to interpret the results. Two failure modes we focus on are a service becoming slower (increase in response latency) or a service failing outright (returning errors).

Latency

Latency Engineering Metrics Traffic

Page Simulator

The Netflix TechBlog

NOVEMBER 12, 2019

Page Simulation for Better Offline Metrics at Netflix by David Gevorkyan , Mehmet Yilmaz , Ajinkya More , Gaurav Agrawal , Richard Wellington , Vivek Kaushal , Prasanna Padmanabhan , Justin Basilico At Netflix, we spend a lot of effort to make it easy for our members to find content they will love.

Metrics

Metrics Government Systems Testing

Why Traditional Monitoring Isn’t Enough for Modern Web Applications

Dotcom-Montior

MAY 12, 2020

Modern web applications and pages, such as single-page applications, that put the user experience at its utmost priority are expected to be available 24/7, anywhere in the world, usable on any screen size, secure, flexible, scalable and be ready to meet traffic spikes on demand. Network latency. Network Latency. Connection time.

Monitoring

Monitoring Entertainment Hardware Latency

Page Simulator

The Netflix TechBlog

NOVEMBER 12, 2019

Page Simulation for Better Offline Metrics at Netflix by David Gevorkyan , Mehmet Yilmaz , Ajinkya More , Gaurav Agrawal , Richard Wellington , Vivek Kaushal , Prasanna Padmanabhan , Justin Basilico At Netflix, we spend a lot of effort to make it easy for our members to find content they will love.

Metrics

Metrics Government Systems Testing

Page Simulator

The Netflix TechBlog

NOVEMBER 12, 2019

Page Simulation for Better Offline Metrics at Netflix by David Gevorkyan , Mehmet Yilmaz , Ajinkya More , Gaurav Agrawal , Richard Wellington , Vivek Kaushal , Prasanna Padmanabhan , Justin Basilico At Netflix, we spend a lot of effort to make it easy for our members to find content they will love.

Metrics

Metrics Government Systems Testing

HTTP/3: Performance Improvements (Part 2)

Smashing Magazine

AUGUST 22, 2021

Because we are dealing with network protocols here, we will mainly look at network aspects, of which two are most important: latency and bandwidth. Latency can be roughly defined as the time it takes to send a packet from point A (say, the client) to point B (the server). Two-way latency is often called round-trip time (RTT).

Performance

Performance Network Latency Servers

HTTP/3 From A To Z: Core Concepts (Part 1)

Smashing Magazine

AUGUST 9, 2021

You’ve probably heard things like: “HTTP/3 is much faster than HTTP/2 when there is packet loss”, or “HTTP/3 connections have less latency and take less time to set up”, and probably “HTTP/3 can send data more quickly and can send more resources in parallel”. TLS, TCP, and QUIC handshake durations ( Large preview ).

Transportation

Transportation Internet Internet Network

Front-End Performance Checklist 2021

Smashing Magazine

JANUARY 11, 2021

LogRocket tracks key metrics, incl. Getting Ready: Planning And Metrics Performance culture, Core Web Vitals, performance profiles, CrUX, Lighthouse, FID, TTI, CLS, devices. Assets Optimizations Brotli, AVIF, WebP, responsive images, AV1, adaptive media loding, video compression, web fonts, Google fonts. Table Of Contents.

Performance

Performance Cache Media Metrics

Front-End Performance Checklist 2020 [PDF, Apple Pages, MS Word]

Smashing Magazine

JANUARY 6, 2020

Getting Ready: Planning And Metrics. Getting Ready: Planning And Metrics. You need a business stakeholder buy-in, and to get it, you need to establish a case study, or a proof of concept using the Performance API on how speed benefits metrics and Key Performance Indicators ( KPIs ) they care about. Table Of Contents. Quick Wins.

Performance

Performance Cache Network Metrics

Front-End Performance Checklist 2019 [PDF, Apple Pages, MS Word]

Smashing Magazine

JANUARY 7, 2019

Getting Ready: Planning And Metrics. Getting Ready: Planning And Metrics. You need a business stakeholder buy-in, and to get it, you need to establish a case study on how speed benefits metrics and Key Performance Indicators ( KPIs ) they care about. Table Of Contents. Setting Realistic Goals. Defining The Environment.

Performance

Performance Cache Metrics Network

Technology Performance Pulse

Implementing service-level objectives to improve software quality

Seamlessly Swapping the API backend of the Netflix Android app

Trending Sources

Data Reprocessing Pipeline in Asset Management Platform @Netflix

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Building Netflix’s Distributed Tracing Infrastructure

Percentiles don’t work: Analyzing the distribution of response times for web services

Migrating Netflix to GraphQL Safely

Netflix Video Quality at Scale with Cosmos Microservices

Service level objectives: 5 SLOs to get started

Service level objective examples: 5 SLO examples for faster, more reliable apps

Edgar: Solving Mysteries Faster with Observability

How We Optimized Performance To Serve A Global Audience

Why Telcos Need a Real-Time Analytics Strategy

Why Telcos Need a Real-Time Analytics Strategy

Automating chaos experiments in production

Page Simulator

Why Traditional Monitoring Isn’t Enough for Modern Web Applications

Page Simulator

Page Simulator

HTTP/3: Performance Improvements (Part 2)

HTTP/3 From A To Z: Core Concepts (Part 1)

Front-End Performance Checklist 2021

Front-End Performance Checklist 2020 [PDF, Apple Pages, MS Word]

Front-End Performance Checklist 2019 [PDF, Apple Pages, MS Word]

Stay Connected