Cache, Conference and Latency - Technology Performance Pulse

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

NOVEMBER 3, 2022

We introduce a caching mechanism in the API gateway layer, allowing us to offload processing from singleton leader elected controllers without giving up strict data consistency and guarantees clients observe. We started seeing increased response latencies and leader servers running at dangerously high utilization.

Cache

Cache Latency Traffic Systems

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Or even on a plane.

Performance

Performance Latency Cache Virtualization

This spring: High-Performance and Low-Latency C++ (Stockholm) and ACCU (Bristol)

Sutter's Mill

FEBRUARY 13, 2017

Tue-Thu Apr 25-27: High-Performance and Low-Latency C++ (Stockholm). On April 25-27, I’ll be in Stockholm (Kista) giving a three-day seminar on “High-Performance and Low-Latency C++.” If you’re interested in attending, please check out the links, and I look forward to meeting and re-meeting many of you there.

Latency

Latency C++ Hardware Performance

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Or even on a plane. It was a great privilege.

Performance

Performance Latency Cache Virtualization

Percentiles don’t work: Analyzing the distribution of response times for web services

Adrian Cockcroft

JANUARY 29, 2023

The mean and percentile measurements hide this structure, but the rest of this post will show how the structure can be measured and analyzed so that you can figure out a useful model of your system, understand what is driving the long tail of latencies and come up with better SLAs and measures of capacity.

Lambda

Lambda Latency Cache C++

Current status, needs, and challenges in Heterogeneous and Composable Memory from the HCM workshop (HPCA’23)

ACM Sigarch

MAY 31, 2023

There are three common mechanisms to access remote memory: modifying applications, modifying virtual memory, and hardware-level cache coherence support. even lowered the latency by introducing a multi-headed device that collapses switches and memory controllers. The recently announced CXL3.0

Latency

Latency Hardware Cache Architecture

Supercomputing Predictions: Custom CPUs, CXL3.0, and Petalith Architectures

Adrian Cockcroft

JANUARY 20, 2023

Summary of types of applications for supercomputers We spent most of the conference walking around the expo and talking to people, rather than attending technical sessions, and asked several people why other vendors weren’t copying Fugaku, and what improvements in architecture were on the horizon.

Architecture

Architecture Latency Benchmarking AWS

How To Add eBPF Observability To Your Product

Brendan Gregg

JULY 2, 2021

biolatency Disk I/O latency histogram heat map. cachestat File system cache statistics line charts. runqlat CPU scheduler latency heat map. If you really want to do this and have the time, you certainly can (you'll probably wind up at tracing conferences and bumping into me: See you at Linux Plumber's or the Tracing Summit!)

Latency

Latency Cache Energy Systems

A thorough introduction to bpftrace

Brendan Gregg

AUGUST 18, 2019

For example, iostat(1), or a monitoring agent, may tell you your average disk latency, but not the distribution of this latency. For smaller environments, it can be of more use helping eliminate latency outliers. Block I/O latency as a histogram. This traces block I/O, and shows latency as a power-of-2 histogram.

Latency

Latency C++ Cache Programming

A persistent problem: managing pointers in NVM

The Morning Paper

DECEMBER 8, 2019

At the start of November I was privileged to attend HPTS (the High Performance Transaction Systems) conference in Asilomar. On the last morning of the conference Daniel Bittman presented some of the work being done in the context of the Twizzler OS project to explore new programming models for NVM. The Twizzler programming model.

Hardware

Hardware Programming Media Storage

Trade-offs under pressure: heuristics and observations of teams resolving internet service outages (Part II)

The Morning Paper

JANUARY 23, 2020

1:00pm Eastern Standard Time the Personalisation / Homepage Team for Etsy are in a conference room kicking off a lunch-and-learn session on the personalised feed feature on the Etsy.com homepage. 1:18pm a key observation was made that an API call to populate the homepage sidebar saw a huge jump in latency.

Internet

Internet Internet Cache Engineering

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

All Things Distributed

OCTOBER 2, 2017

The success of our early results with the Dynamo database encouraged us to write Amazon's Dynamo whitepaper and share it at the 2007 ACM Symposium on Operating Systems Principles (SOSP conference), so that others in the industry could benefit.

Internet

Internet Internet AWS Performance

On HTTPS and Hard Questions

Tim Kadlec

AUGUST 14, 2018

The area he was in was served by satellite internet access, and experienced significant latency (a floor of 506 milliseconds) and packet loss (between 50-80% was typical). To counter this, the school he was visiting sets up their own local caching server. But, as he explains, this approach falls apart when HTTPS gets involved.

Cache

Cache Mobile Servers Latency

HTTP/3: Performance Improvements (Part 2)

Smashing Magazine

AUGUST 22, 2021

Because we are dealing with network protocols here, we will mainly look at network aspects, of which two are most important: latency and bandwidth. Latency can be roughly defined as the time it takes to send a packet from point A (say, the client) to point B (the server). Two-way latency is often called round-trip time (RTT).

Performance

Performance Network Latency Servers

How To Add eBPF Observability To Your Product

Brendan Gregg

JULY 2, 2021

biolatency Disk I/O latency histogram heat map 5. cachestat File system cache statistics line charts 7. runqlat CPU scheduler latency heat map 10. Here are the top ten tools you can run and present as a generic BPF observability dashboard, along with suggested visualizations: Tool Shows Visualization 1.

Open Source

Open Source Latency Cache Energy

Revisiting “Serverless Architectures”

The Symphonia

MAY 22, 2018

I was a little restricted in my thinking the first time around and I’ve come to see FaaS as something not quite stateless, since caching state in a Lambda instance that might stick around for 5 hours is a perfectly reasonable idea. I also rewrote the section on Startup Latency since Cold Starts are one of the big “FUD” areas of Serverless.

Serverless

Serverless Architecture Lambda Azure

Invited Talk at SuperComputing 2016!

John McCalpin

OCTOBER 16, 2016

“Memory Bandwidth and System Balance in HPC Systems” If you are planning to attend the SuperComputing 2016 conference in Salt Lake City next month, be sure to reserve a spot on your calendar for my talk on Wednesday afternoon (4:15pm-5:00pm).

Architecture

Architecture Systems Technology Technology

HTTP/3 From A To Z: Core Concepts (Part 1)

Smashing Magazine

AUGUST 9, 2021

You may have read some blog posts or heard conference talks on this topic and think you know the answers. However, many other devices are sitting between the client and the server that also have their own TCP code on board (examples include firewalls, load balancers, routers, caching servers, proxies, etc.). Robin Marx.

Transportation

Transportation Internet Internet Network

Technology Performance Pulse

Consistent caching mechanism in Titus Gateway

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Trending Sources

This spring: High-Performance and Low-Latency C++ (Stockholm) and ACCU (Bristol)

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Percentiles don’t work: Analyzing the distribution of response times for web services

Current status, needs, and challenges in Heterogeneous and Composable Memory from the HCM workshop (HPCA’23)

Supercomputing Predictions: Custom CPUs, CXL3.0, and Petalith Architectures

How To Add eBPF Observability To Your Product

A thorough introduction to bpftrace

A persistent problem: managing pointers in NVM

Trade-offs under pressure: heuristics and observations of teams resolving internet service outages (Part II)

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

On HTTPS and Hard Questions

HTTP/3: Performance Improvements (Part 2)

How To Add eBPF Observability To Your Product

Revisiting “Serverless Architectures”

Invited Talk at SuperComputing 2016!

HTTP/3 From A To Z: Core Concepts (Part 1)

Stay Connected