article thumbnail

The Back-to-Basics Readings of 2012 - All Things Distributed

All Things Distributed

Werner Vogels weblog on building scalable and robust distributed systems. The Back-to-Basics Readings of 2012. By Werner Vogels on 18 December 2012 10:00 PM. I am pretty sure some if not all of these papers deserved to be elected to the hall of fame of best papers in distributed systems. All Things Distributed.

article thumbnail

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

For busy site reliability engineers, ensuring system reliability, scalability, and overall health is an imperative that’s getting harder to achieve in ever-expanding, cloud-native, container-based environments. To get a more granular look into telemetry data, many analysts rely on custom metrics using Prometheus.

Metrics 180
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Stuff The Internet Says On Scalability For March 22nd, 2019

High Scalability

Let them dogfood the software patch. skamille : I worry that the cloud is just moving us back to a world of proprietary software. µs of replication latency on lossy Ethernet, which is faster than or comparable to specialized replication systems that use programmable switches, FPGAs, or RDMA.". We achieve 5.5

Internet 134
article thumbnail

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

I also wrote about these topics in detail for my recent [Systems Performance 2nd Edition] book. TCP Extensions for Multipath Operation with Multiple Addresses,” [link] Mar 2020 - [Gregg 20] Brendan Gregg, “Systems Performance: Enterprise and the Cloud, Second Edition,” Addison-Wesley, 2020 - [Hruska 20] Joel Hruska, “Intel Demos PCIe 5.0

article thumbnail

Back-to-Basics Weekend Reading - Staged Event-Driven Architecture

All Things Distributed

Werner Vogels weblog on building scalable and robust distributed systems. By Werner Vogels on 17 August 2012 07:00 PM. I am in São Paolo, Brazil for the 2012 AWS Latin America Summit and for The Next Web Latin America conference. Several of the principles from this paper have made it into systems I have since built.

article thumbnail

The Importance of a Great Developer Experience

Strategic Tech

In February 2012 I began working for a new company. Back in 2012, my CTO was passionate about “every developer pushing [meaningful] code to production on their first day”. Delightful Local Development Environment Naturally, developers spend a large amount of their time using their computer to build software systems.

article thumbnail

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

This talk originated from my updates to [Systems Performance 2nd Edition], and this was the first time I've given this talk in person! CXL in a way allows a custom memory controller to be added to a system, to increase memory capacity, bandwidth, and overall performance. Ford, et al., “TCP