article thumbnail

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

This was a chance to talk about other things I've been working on, such as the present and future of hardware performance. I also wrote about these topics in detail for my recent [Systems Performance 2nd Edition] book. Note that my predictions in this talk may be wrong, but they should be thought provoking. Ford, et al., “TCP

article thumbnail

A Brief Guide of xPU for AI Accelerators

ACM Sigarch

The new DGX-2, which achieves 2 petaFLOPS in the system, combines 16 fully interconnected GPUS with 10x the deep learning performance. HPU: Holographic Processing Unit (HPU) is the specific hardware of Microsoft’s Hololens. SPU: Stream Processing Unit (SPU) is related to the specialized hardware to process the data streams of video.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

The Return of the Frame Pointers

Brendan Gregg

The problem is that this system has a default libc that has been compiled without frame pointers, so any stack walking stops at the libc layer, producing a partial stack that's missing the application frames. and we may have been flying close to the edge of hardware cache warmth, where adding a bit more instructions caused a big drop.

Java 145
article thumbnail

The Speed of Time

Brendan Gregg

CLI tools The Cassandra systems were EC2 virtual machine (Xen) instances. As a Xen guest, this profile was gathered using perf(1) and the kernel's software cpu-clock soft interrupts, not the hardware NMI. Microbenchmark os::javaTimeMillis() on both systems. Running this on the two systems saw similar results.

Speed 126
article thumbnail

AWS EC2 Virtualization 2017: Introducing Nitro

Brendan Gregg

Hardware virtualization for cloud computing has come a long way, improving performance using technologies such as VT-x, SR-IOV, VT-d, NVMe, and APICv. The latest AWS hypervisor, Nitro, uses everything to provide a new hardware-assisted hypervisor that is easy to use and has near bare-metal performance. I'd expect between 0.1%

article thumbnail

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

This talk originated from my updates to [Systems Performance 2nd Edition], and this was the first time I've given this talk in person! CXL in a way allows a custom memory controller to be added to a system, to increase memory capacity, bandwidth, and overall performance. Ford, et al., “TCP

article thumbnail

Safari 16.4 Is An Admission

Alex Russell

In leaner years (2012-2015), a single Fall release was all we'd get. From outright misstatements about a competitor's security, to claims that performance differences in hardware show Safari is faster, to [geographic brinksmanship](/2022/02/minimum-standards/), the confident bluster hasn't gone down particularly well.

Energy 79