Cache, Conference, Latency and Network - Technology Performance Pulse

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Or even on a plane.

Performance

Performance Latency Cache Virtualization

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Or even on a plane. It was a great privilege.

Performance

Performance Latency Cache Virtualization

Percentiles don’t work: Analyzing the distribution of response times for web services

Adrian Cockcroft

JANUARY 29, 2023

The mean and percentile measurements hide this structure, but the rest of this post will show how the structure can be measured and analyzed so that you can figure out a useful model of your system, understand what is driving the long tail of latencies and come up with better SLAs and measures of capacity.

Lambda

Lambda Latency Cache C++

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

NOVEMBER 3, 2022

We introduce a caching mechanism in the API gateway layer, allowing us to offload processing from singleton leader elected controllers without giving up strict data consistency and guarantees clients observe. We started seeing increased response latencies and leader servers running at dangerously high utilization.

Cache

Cache Latency Traffic Systems

A thorough introduction to bpftrace

Brendan Gregg

AUGUST 18, 2019

For example, iostat(1), or a monitoring agent, may tell you your average disk latency, but not the distribution of this latency. For smaller environments, it can be of more use helping eliminate latency outliers. Block I/O latency as a histogram. This traces block I/O, and shows latency as a power-of-2 histogram.

Latency

Latency C++ Cache Programming

This spring: High-Performance and Low-Latency C++ (Stockholm) and ACCU (Bristol)

Sutter's Mill

FEBRUARY 13, 2017

Tue-Thu Apr 25-27: High-Performance and Low-Latency C++ (Stockholm). On April 25-27, I’ll be in Stockholm (Kista) giving a three-day seminar on “High-Performance and Low-Latency C++.” If you’re interested in attending, please check out the links, and I look forward to meeting and re-meeting many of you there.

Latency

Latency C++ Hardware Performance

Supercomputing Predictions: Custom CPUs, CXL3.0, and Petalith Architectures

Adrian Cockcroft

JANUARY 20, 2023

Most of the top supercomputers are similar to Frontier, they use AMD or Intel CPUs, with GPU accelerators, and Cray Slingshot or Infiniband networks in a Dragonfly+ configuration. The four categories still make sense: kernel managed network sockets, user mode message passing libraries, coherent memory interfaces, and on-chip communication.

Architecture

Architecture Latency Benchmarking AWS

HTTP/3: Performance Improvements (Part 2)

Smashing Magazine

AUGUST 22, 2021

As we will see, QUIC and HTTP/3 indeed have great web performance potential, but mainly for users on slow networks. If your average visitor is on a fast cabled or cellular network, they probably won’t benefit from the new protocols all that much. Two-way latency is often called round-trip time (RTT). Congestion Control.

Performance

Performance Network Latency Servers

A persistent problem: managing pointers in NVM

The Morning Paper

DECEMBER 8, 2019

At the start of November I was privileged to attend HPTS (the High Performance Transaction Systems) conference in Asilomar. On the last morning of the conference Daniel Bittman presented some of the work being done in the context of the Twizzler OS project to explore new programming models for NVM. The Twizzler programming model.

Hardware

Hardware Programming Media Storage

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

All Things Distributed

OCTOBER 2, 2017

The success of our early results with the Dynamo database encouraged us to write Amazon's Dynamo whitepaper and share it at the 2007 ACM Symposium on Operating Systems Principles (SOSP conference), so that others in the industry could benefit.

Internet

Internet Internet AWS Performance

HTTP/3 From A To Z: Core Concepts (Part 1)

Smashing Magazine

AUGUST 9, 2021

You may have read some blog posts or heard conference talks on this topic and think you know the answers. It also, however, takes a full network round trip to complete before anything else can be done on a connection. and lower), this typically takes two network round trips. Robin Marx. 2021-08-09T11:00:00+00:00.

Transportation

Transportation Internet Internet Network

Technology Performance Pulse

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Trending Sources

Percentiles don’t work: Analyzing the distribution of response times for web services

Consistent caching mechanism in Titus Gateway

A thorough introduction to bpftrace

This spring: High-Performance and Low-Latency C++ (Stockholm) and ACCU (Bristol)

Supercomputing Predictions: Custom CPUs, CXL3.0, and Petalith Architectures

HTTP/3: Performance Improvements (Part 2)

A persistent problem: managing pointers in NVM

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

HTTP/3 From A To Z: Core Concepts (Part 1)

Stay Connected