Remove Architecture Remove Latency Remove Performance Remove Tuning
article thumbnail

Allegro Reduces Kafka Producer Latency Outliers by 82% After Switching to XFS

InfoQ

Allegro experimented with different performance optimization options to improve Apache Kafka producer tail latency and eventually switched all its clusters to the XFS filesystem. The company used Kafka protocol sniffing, JVM profiling, and eBPF, which proved instrumental in identifying and eliminating performance bottlenecks.

Latency 110
article thumbnail

Bending pause times to your will with Generational ZGC

The Netflix TechBlog

Reduced tail latencies In both our GRPC and DGS Framework services, GC pauses are a significant source of tail latencies. In fact, we’ve found for our services and architecture that there is no such trade off. No explicit tuning has been required to achieve these results.

Latency 228
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This approach has a handful of benefits.

Traffic 339
article thumbnail

LinkedIn Migrates Espresso to HTTP2 and Reduces Connections by 88% and Latency by 75%

InfoQ

LinkedIn was able to dramatically improve the scalability and performance of its Espresso database by migrating it from HTTP1.1 to HTTP2, resulting in a reduction in the number of connections, latency, and garbage collection times. By Rafal Gancarz

Latency 99
article thumbnail

Enhancing Kubernetes cluster management key to platform engineering success

Dynatrace

During a breakout session at Dynatrace Perform 2024 , Alois Mayr, principal product manager at Dynatrace, and Stefano Doni, CTO at Akamas, broke down how Dynatrace and Akamas can help organizations enhance Kubernetes cluster management. Consider a sudden drop in performance. “Additionally, we are full-stack and goal-oriented.

article thumbnail

Evolving from Rule-based Classifier: Machine Learning Powered Auto Remediation in Netflix Data…

The Netflix TechBlog

Operational automation–including but not limited to, auto diagnosis, auto remediation, auto configuration, auto tuning, auto scaling, auto debugging, and auto testing–is key to the success of modern data platforms. Auto Remediation generates recommendations by considering both performance (i.e., Multi-objective optimizations.

Tuning 210
article thumbnail

Automated observability, security, and reliability at scale

Dynatrace

This is especially crucial in microservice architectures, where the number of components can be overwhelming. This same mechanism can also be leveraged to validate the impact of new software releases on resources, logs, performance, reliability, or business measures. You can read all about it in our Configuration as Code documentation.