article thumbnail

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

Reconstructing a streaming session was a tedious and time consuming process that involved tracing all interactions (requests) between the Netflix app, our Content Delivery Network (CDN), and backend microservices. Our trace data collection agent transports traces to Mantis job cluster via the Mantis Publish library.

article thumbnail

Towards a Reliable Device Management Platform

The Netflix TechBlog

When a new hardware device is connected, the Local Registry detects and collects a set of information about it, such as networking information and ESN. Fault Tolerance If the underlying KafkaConsumer crashes due to ephemeral system or network events, it should be automatically restarted. million elements.

Latency 213
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Plan Your Multi Cloud Strategy

Scalegrid

They can also bolster uptime and limit latency issues or potential downtimes. Adopting open-source standards and tools like Kubernetes lays the groundwork for creating adaptable and transportable solutions that promote application deployment and management in various cloud environments.

Strategy 130
article thumbnail

Snap: a microkernel approach to host networking

The Morning Paper

Snap: a microkernel approach to host networking Marty et al., This paper describes the networking stack, Snap , that has been running in production at Google for the last three years+. I’m jumping ahead a bit here, but the component of Snap which provides the transport and communications stack is called Pony Express.

Network 92
article thumbnail

Edgar: Solving Mysteries Faster with Observability

The Netflix TechBlog

This difference has substantial technological implications, from the classification of what’s interesting to transport to cost-effective storage (keep an eye out for later Netflix Tech Blog posts addressing these topics). Distributed tracing is the process of generating, transporting, storing, and retrieving traces in a distributed system.

Latency 296
article thumbnail

Edge Authentication and Token-Agnostic Identity Propagation

The Netflix TechBlog

A few years ago, we decided to address this complexity by spinning up a new initiative, and eventually a new team, to move the complex handling of user and device authentication, and various security protocols and tokens, to the edge of the network, managed by a set of centralized services, and a single team.

article thumbnail

HTTP/3 From A To Z: Core Concepts (Part 1)

Smashing Magazine

You’ve probably heard things like: “HTTP/3 is much faster than HTTP/2 when there is packet loss”, or “HTTP/3 connections have less latency and take less time to set up”, and probably “HTTP/3 can send data more quickly and can send more resources in parallel”. HTTP/2 versus HTTP/3 protocol stack comparison ( Large preview ). What Is QUIC?