Latency, Software and Transportation - Technology Performance Pulse

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

If we had an ID for each streaming session then distributed tracing could easily reconstruct session failure by providing service topology, retry and error tags, and latency measurements for all service calls. Our trace data collection agent transports traces to Mantis job cluster via the Mantis Publish library. What’s next?

Infrastructure

Infrastructure Transportation Storage Open Source

Towards a Reliable Device Management Platform

The Netflix TechBlog

AUGUST 30, 2021

By Benson Ma , Alok Ahuja Introduction At Netflix, hundreds of different device types, from streaming sticks to smart TVs, are tested every day through automation to ensure that new software releases continue to deliver the quality of the Netflix experience that our customers enjoy. In this blog post, we will focus on the latter feature set.

Latency

Latency Traffic Transportation Hardware

Expanding the Cloud – The Second AWS GovCloud (US) Region, AWS GovCloud (US-East)

All Things Distributed

NOVEMBER 12, 2018

The AWS GovCloud (US-East) Region is located in the eastern part of the United States, providing customers with a second isolated Region in which to run mission-critical workloads with lower latency and high availability. By using AWS, they have been able to reduce the time to build, test, and scale software from weeks to hours.

AWS

AWS Healthcare Cloud Government

Plan Your Multi Cloud Strategy

Scalegrid

MARCH 22, 2024

They can also bolster uptime and limit latency issues or potential downtimes. It’s important to ensure the bells and whistles of any software-as-a-service (SaaS) they offer can support where you aim to take your business, keeping your strategy tight and on track.

Strategy

Strategy Cloud Government Innovation

Unlocking Enterprise systems using voice

All Things Distributed

MARCH 12, 2018

The availability of large scale voice training data, the advances made in software with processing engines such as Caffe, MXNet and Tensorflow, and the rise of massively parallel compute engines with low-latency memory access, such as the Amazon EC2 P3 instances have made voice processing at scale a reality.

Systems

Systems AWS Games Transportation

Edgar: Solving Mysteries Faster with Observability

The Netflix TechBlog

SEPTEMBER 2, 2020

This difference has substantial technological implications, from the classification of what’s interesting to transport to cost-effective storage (keep an eye out for later Netflix Tech Blog posts addressing these topics). Distributed tracing is the process of generating, transporting, storing, and retrieving traces in a distributed system.

Latency

Latency Transportation Engineering Traffic

Snap: a microkernel approach to host networking

The Morning Paper

NOVEMBER 10, 2019

It’s been clear for a while that software designed explicitly for the data center environment will increasingly want/need to make different design trade-offs to e.g. general-purpose systems software that you might install on your own machines. The desire for CPU efficiency and lower latencies is easy to understand. Enter Google!

Network

Network Transportation Latency Entertainment

Edge Authentication and Token-Agnostic Identity Propagation

The Netflix TechBlog

FEBRUARY 9, 2021

enum Source { NONE = 0 ; COOKIE = 1 ; COOKIE_INSECURE = 2; MSL = 3 ; PARTNER_TOKEN = 4 ; … } enum PassportAuthenticationLevel { LOW = 1 ; // untrusted transport HIGH = 2 ; // secure tokens over TLS HIGHEST = 3 ; // MSL or user credentials } Downstream applications can use these values to make Authorization and/or user experience decisions.

Architecture

Architecture Latency Servers Website

Välkommen till Stockholm – An AWS Region is coming to the Nordics

All Things Distributed

APRIL 4, 2017

This enables customers to serve content to their end users with low latency, giving them the best application experience. In 2011, AWS opened a Point of Presence (PoP) in Stockholm to enable customers to serve content to their end users with low latency. As well as AWS Regions, we also have 24 AWS Edge Network Locations in Europe.

AWS

AWS Airlines Latency Games

RPCValet: NI-driven tail-aware balancing of µs-scale RPCs

The Morning Paper

MAY 19, 2019

Last week we learned about the [increased tail-latency sensitivity of microservices based applications with high RPC fan-outs. Seer uses estimates of queue depths to mitigate latency spikes on the order of 10-100ms, in conjunction with a cluster manager. So what we have here is a glimpse of the limits for low-latency RPCs under load.

Latency

Latency Hardware Network Architecture

Why I hate MPI (from a performance analysis perspective)

John McCalpin

AUGUST 1, 2018

This will typically include environment variables that can influence the behavior of the MPI runtime, and might include environment variables that can influence the behavior of the lower-level shared-memory transport and/or network hardware interfaces. The processor hardware available to support shared-memory transport.

Hardware

Hardware Transportation Performance Latency

Transforming enterprise integration with reactive streams

O'Reilly Software

MARCH 7, 2018

Software today is not typically a single program—something that is executed by an operator or user, producing a result to that person—but rather a service : something that runs for the benefit of its consumers, a provider of value. enum Transport {. Let’s dive into this concept for a bit. The most common programming task in the world.

Transportation

Transportation Java Programming Architecture

HTTP/3: Performance Improvements (Part 2)

Smashing Magazine

AUGUST 22, 2021

Because we are dealing with network protocols here, we will mainly look at network aspects, of which two are most important: latency and bandwidth. Latency can be roughly defined as the time it takes to send a packet from point A (say, the client) to point B (the server). Two-way latency is often called round-trip time (RTT).

Performance

Performance Network Latency Servers

Talk Video: Welcome to the Jungle (60 min version + Q&A)

Sutter's Mill

JUNE 21, 2012

One of the slides I omitted to shorten this version of the talk highlighted that there are actually two issues when you go from “Disjoint (tightly coupled)” to “Disjoint (loosely coupled)”: reliability and latency , and both are important. (I I also mentioned this in the original WttJ article this is based on; just search for “reliability.”).

Latency

Latency Transportation Hardware Education

HTTP/3: Practical Deployment Options (Part 3)

Smashing Magazine

SEPTEMBER 6, 2021

Finally, not inlining resources has an added latency cost because the file needs to be requested. As such, many firewall vendors currently recommend blocking QUIC until they can update their software. First, in part 1 , we discussed that HTTP/3 was needed mainly because of the new underlying QUIC transport protocol.

Network

Network Servers Cache Traffic

Software Testing Trends 2021 – What can we expect?

Testsigma

FEBRUARY 12, 2021

The implementation of emerging technologies has helped improve the process of software development, testing, design and deployment. Any organization recruits experienced testing agencies to comply with their specifications for software testing. Here is the list of software testing trends you need to look out for in 2021.

Artificial Intelligence

Artificial Intelligence Software Software IoT

Front-End Performance Checklist 2021

Smashing Magazine

JANUARY 11, 2021

Estimated Input Latency tells us if we are hitting that threshold, and ideally, it should be below 50ms. Designed for the modern web, it responds to actual congestion, rather than packet loss like TCP does, it is significantly faster , with higher throughput and lower latency — and the algorithm works differently.

Performance

Performance Cache Media Metrics

Front-End Performance Checklist 2020 [PDF, Apple Pages, MS Word]

Smashing Magazine

JANUARY 6, 2020

Estimated Input Latency tells us if we are hitting that threshold, and ideally, it should be below 50ms. Designed for the modern web, it responds to actual congestion, rather than packet loss like TCP does, it is significantly faster , with higher throughput and lower latency — and the algorithm works differently.

Performance

Performance Cache Network Metrics

Front-End Performance Checklist 2019 [PDF, Apple Pages, MS Word]

Smashing Magazine

JANUARY 7, 2019

Estimated Input Latency tells us if we are hitting that threshold, and ideally, it should be below 50ms. Ah, and don’t use JPEG-XR on the web — "the processing of decoding JPEG-XRs software-side on the CPU nullifies and even outweighs the potentially positive impact of byte size savings, especially in the context of SPAs".

Performance

Performance Cache Metrics Network

Technology Performance Pulse

Building Netflix’s Distributed Tracing Infrastructure

Towards a Reliable Device Management Platform

Trending Sources

Expanding the Cloud – The Second AWS GovCloud (US) Region, AWS GovCloud (US-East)

Plan Your Multi Cloud Strategy

Unlocking Enterprise systems using voice

Edgar: Solving Mysteries Faster with Observability

Snap: a microkernel approach to host networking

Edge Authentication and Token-Agnostic Identity Propagation

Välkommen till Stockholm – An AWS Region is coming to the Nordics

RPCValet: NI-driven tail-aware balancing of µs-scale RPCs

Why I hate MPI (from a performance analysis perspective)

Transforming enterprise integration with reactive streams

HTTP/3: Performance Improvements (Part 2)

Talk Video: Welcome to the Jungle (60 min version + Q&A)

HTTP/3: Practical Deployment Options (Part 3)

Software Testing Trends 2021 – What can we expect?

Front-End Performance Checklist 2021

Front-End Performance Checklist 2020 [PDF, Apple Pages, MS Word]

Front-End Performance Checklist 2019 [PDF, Apple Pages, MS Word]

Stay Connected