Latency, Performance and Traffic - Technology Performance Pulse

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This approach has a handful of benefits.

Traffic

Traffic Latency Tuning Systems

Maximize user experience with out-of-the-box service-performance SLOs

Dynatrace

AUGUST 25, 2023

This article explores SLOs for service performance. According to the Google Site Reliability Engineering (SRE) handbook, monitoring the four golden signals is crucial in delivering high-performing software solutions. SLOs, as a measure of service quality, can track the related availability, reliability, and performance.

Performance

Performance Latency Traffic Metrics

The Power of Caching: Boosting API Performance and Scalability

DZone

AUGUST 16, 2023

Benefits of Caching Improved performance: Caching eliminates the need to retrieve data from the original source every time, resulting in faster response times and reduced latency. Reduced server load: By serving cached content, the load on the server is reduced, allowing it to handle more requests and improving overall scalability.

Cache

Cache Scalability Performance Latency

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace

FEBRUARY 21, 2024

Benefits of quality gates Quality gates provide several advantages to organizations, including the following: Optimized software performance : Quality gates assess code at different SDLC stages and ensure that only high-quality code progresses. Several tools can be used to collect metrics in load/performance testing.

Speed

Speed Software Software Latency

FIFO vs. LIFO: Which Queueing Strategy Is Better for Availability and Latency?

DZone

MARCH 14, 2023

As an engineer, you probably know that server performance under heavy load is crucial for maintaining the availability and responsiveness of your services. But what happens when traffic bursts overwhelm your system? Queueing requests is a common solution, but what's the best approach: FIFO or LIFO?

Strategy

Strategy Latency Availability Traffic

Bending pause times to your will with Generational ZGC

The Netflix TechBlog

MARCH 5, 2024

Reduced tail latencies In both our GRPC and DGS Framework services, GC pauses are a significant source of tail latencies. Each of these errors is a canceled request resulting in a retry so this reduction further reduces overall service traffic by this rate: Errors rates per second. There is no best garbage collector.

Latency

Latency Java Tuning Efficiency

Event-Based Autoscaling: Ensuring Smooth Operations on Your Peak Days

DZone

JANUARY 21, 2024

These organizations face a common challenge – how much infrastructure do they need to ensure optimal performance without overprovisioning – which can become very costly, very quickly. Even retail giants like Amazon have faced customer dissatisfaction during events like Prime Day when the website couldn't handle the traffic.

Retail

Retail Games Latency Traffic

Service level objectives: 5 SLOs to get started

Dynatrace

JUNE 1, 2023

Service level objectives (SLOs) provide a powerful framework for measuring and maintaining software performance, reliability, and user satisfaction. SLOs are a valuable tool for organizations to ensure the health and performance of their applications. Note : you might hear the term latency used instead of response time.

Latency

Latency Website Traffic Virtualization

Service level objective examples: 5 SLO examples for faster, more reliable apps

Dynatrace

JUNE 1, 2023

Service level objectives (SLOs) provide a powerful framework for measuring and maintaining software performance, reliability, and user satisfaction. Teams can build on these SLO examples to improve application performance and reliability. Note : you might hear the term latency used instead of response time.

Traffic

Traffic Latency Website Virtualization

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

Percona

DECEMBER 11, 2023

A quick configuration change may do the trick in improving the performance of your AWS RDS for MySQL instance. Here, we will discuss a notable new feature in Amazon RDS, the Dedicated Log Volume (DLV), that has been introduced to boost database performance. Who can benefit from DLV? 2xlarge c5.2xlarge MySQL 8.0.31

AWS

AWS Benchmarking Performance Traffic

How Dynatrace boosts production resilience with Site Reliability Guardian

Dynatrace

MAY 17, 2023

Validation tasks are then extended left to cover performance testing and release validation in a pre-production environment. While the first guardian validates the traffic, the second guardian checks the business transactions generated during the observation period. The functionality is implemented via an automated workflow.

DevOps

DevOps Traffic Latency Best Practices

Types Of Performance Testing and When to Use Them

DZone

FEBRUARY 26, 2021

Today, every business wants high-performing and high-quality software. But usually, it is seen that most of the applications fail to deliver expected performance under peak load or fluctuating network conditions. What Is Performance Testing? Today, let's learn more about this testing type in depth.

Performance Testing

Performance Testing Testing Performance Latency

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

With so many of their transactions occurring online, customers are becoming more demanding, expecting websites and applications to always perform perfectly. There are now many more applications, tools, and infrastructure variables that impact an application’s performance and availability. Remember that less is more.

Best Practices

Best Practices DevOps Latency Metrics

Automated Change Impact Analysis with Site Reliability Guardian

Dynatrace

FEBRUARY 15, 2023

SREs use Service-Level Indicators (SLI) to see the complete picture of service availability, latency, performance, and capacity across various systems, especially revenue-critical systems. While this empowers teams to frequently deliver new features, the overall business, security, and quality objectives must be maintained.

DevOps

DevOps Latency Traffic Best Practices

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

Redis® is an in-memory database that provides blazingly fast performance. This makes it a compelling alternative to disk-based databases when performance is a concern. You might already use ScaleGrid hosting for Redis hosting to power your performance-sensitive applications.

Metrics

Metrics Monitoring Latency Cache

Implementing service-level objectives to improve software quality

Dynatrace

DECEMBER 27, 2022

When organizations implement SLOs, they can improve software development processes and application performance. SLOs can be a great way for DevOps and infrastructure teams to use data and performance expectations to make decisions, such as whether to release and where engineers should focus their time. SLOs improve software quality.

Software

Software Software Benchmarking Latency

Towards a Unified Theory of Web Performance

Alex Russell

FEBRUARY 28, 2022

It's being reposted here for completeness, but if you care about web performance, make sure to check out the whole series and get subscribed to the RSS feed to avoid missing any of next year's posts. The predominant answer: a unified theory of web performance. What, in particular, is "web performance"? How do we do it?

Performance

Performance Latency Architecture Network

MySQL Key Performance Indicators (KPI) With PMM

Percona

JUNE 22, 2023

As a MySQL database administrator, keeping a close eye on the performance of your MySQL server is crucial to ensure optimal database operations. A monitoring tool like Percona Monitoring and Management (PMM) is a popular choice among open source options for effectively monitoring MySQL performance.

Performance

Performance Monitoring Traffic Database

Optimizing CDN Architecture: Enhancing Performance and User Experience

IO River

NOVEMBER 2, 2023

CDNs cache content on edge servers distributed globally, reducing the distance between users and the content they want.‍CDNs use load-balancing techniques to distribute incoming traffic across multiple servers called Points of Presence (PoPs) which distribute content closer to end-users and improve overall performance.

Architecture

Architecture Cache Performance Latency

Optimizing CDN Architecture: Enhancing Performance and User Experience

IO River

NOVEMBER 2, 2023

CDNs use load-balancing techniques to distribute incoming traffic across multiple servers called Points of Presence (PoPs) which distribute content closer to end-users and improve overall performance.Â When an edge server goes down, end users in the affected region may experience an increase in latency for that specific location.

Architecture

Architecture Cache Performance Latency

Native App Network Performance Analysis

DZone

APRIL 7, 2021

When 54 percent of the internet traffic share is accounted for by Mobile , it's certainly nontrivial to acknowledge how your app can make a difference to that of the competitor! Introduction.

Network

Network Performance Cache Internet

SLOs done right: how DevOps teams can build better service-level objectives

Dynatrace

MARCH 16, 2023

In the 2023 Perform session “SLOs done right: A practitioners guide,” Michael Cabrera, SRE lead at Vivint, and Andreas Grabner, DevSecOps activist at Dynatrace, break down the state of SLOs and discuss how teams can adopt successful SLOs, avoid less-than-ideal objectives, and ultimately build better SLOs.

DevOps

DevOps Latency Metrics Traffic

How We Optimized Performance To Serve A Global Audience

Smashing Magazine

AUGUST 3, 2023

How We Optimized Performance To Serve A Global Audience How We Optimized Performance To Serve A Global Audience Liran Cohen 2023-08-03T10:00:00+00:00 2023-08-03T13:06:00+00:00 I work for Bookaway , a digital travel brand. It increases our visibility and enables us to draw a steady stream of organic (or “free”) traffic to our site.

Performance

Performance Cache Traffic Metrics

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

JUNE 25, 2020

The network latency between cluster nodes should be around 10 ms or less. Minimized cross-data center network traffic. – A Dynatrace customer, Head of Performance Engineering. Regular Dynatrace Managed deployments can work seamlessly when a maximum of two nodes are down at a time and the network has low latency.

Availability

Availability Hardware Latency Traffic

Build and operate multicloud FaaS with enhanced, intelligent end-to-end observability

Dynatrace

APRIL 25, 2023

For example, to handle traffic spikes and pay only for what they use. It helps developers and operators identify and troubleshoot issues, optimize performance and improve user experience. Scale automatically based on the demand and traffic patterns. The elasticity of serverless services helps organizations scale as needed.

Serverless

Serverless Lambda Azure AWS

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 2 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Picture yourself enthralled by the latest episode of your beloved Netflix series, delighting in an uninterrupted, high-definition streaming experience. This is where large-scale system migrations come into play.

Traffic

Traffic Metrics Systems Strategy

Optimizing Video Streaming CDN Architecture for Cost Reduction and Enhanced Streaming Performance

IO River

NOVEMBER 2, 2023

They need to deliver impeccable performance without breaking the bank.According to recent industry statistics, global streaming has seen an uptick of 30% in the past year, underscoring the importance of efficient CDN architecture strategies. Fundamentally, internet traffic can be broadly categorized into static and dynamic content.Â

Architecture

Architecture Performance Internet Internet

Optimizing Video Streaming CDN Architecture for Cost Reduction and Enhanced Streaming Performance

IO River

NOVEMBER 2, 2023

They need to deliver impeccable performance without breaking the bank.According to recent industry statistics, global streaming has seen an uptick of 30% in the past year, underscoring the importance of efficient CDN architecture strategies. Fundamentally, internet traffic can be broadly categorized into static and dynamic content.

Architecture

Architecture Performance Internet Internet

Data Reprocessing Pipeline in Asset Management Platform @Netflix

The Netflix TechBlog

MARCH 10, 2023

Production assets operations are performed in parallel with older data reprocessing without any service downtime. Existing data got updated to be backward compatible without impacting the existing running production traffic. Instead we use Elasticsearch to search those assets which are more performant.

Media

Media Traffic Processing Design

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

The Morning Paper

OCTOBER 4, 2020

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption , Xu et al., What is the end-to-end throughput and latency, and where are the bottlenecks? Throughput and latency. 5G network paths achieve an average latency of 21.8ms, a 32% reduction on the comparable 4G times.

Energy

Energy Latency Performance Network

Taiji: managing global user traffic for large-scale Internet services at the edge

The Morning Paper

NOVEMBER 14, 2019

Taiji: managing global user traffic for large-scale internet services at the edge Xu et al., It’s another networking paper to close out the week (and our coverage of SOSP’19), but whereas Snap looked at traffic routing within the datacenter, Taiji is concerned with routing traffic from the edge to a datacenter. SOSP’19.

Traffic

Traffic Internet Internet Latency

Multi-CDN Strategy: Benefits and Best Practices

IO River

NOVEMBER 2, 2023

A CDN (Content Delivery Network) is a network of geographically distributed servers that brings web content closer to where end users are located, to ensure high availability, optimized performance and low latency. The result would be a performance drop in the end usersâ€™ experience, often causing the application to become unusable.

Best Practices

Best Practices Strategy Traffic Virtualization

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

SEPTEMBER 8, 2020

For each route we migrated, we wanted to make sure we were not introducing any regressions: either in the form of missing (or worse, wrong) data, or by increasing the latency of each endpoint. Being able to canary a new route let us verify latency and error rates were within acceptable limits. Replay Testing Enter replay testing.

Latency

Latency Cache Java Traffic

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

In this comparison of Redis vs Memcached, we strip away the complexity, focusing on each in-memory data store’s performance, scalability, and unique features. Redis and Memcached both provide high performance with sub-millisecond response times. Choosing between Redis and Memcached hinges on specific application requirements.

Cache

Cache Storage Scalability Architecture

Towards a Reliable Device Management Platform

The Netflix TechBlog

AUGUST 30, 2021

Canary Test Workloads In addition to serving the regular message traffic between users and DUTs, the control plane itself is stress-tested at roughly 3-hour intervals, where nearly 3000 ephemeral MQTT clients are created to connect to and generate flash traffic on the MQTT brokers. million elements.

Latency

Latency Traffic Transportation Hardware

Lessons learned from enterprise service-level objective management

Dynatrace

MAY 19, 2022

This multinational information technology service and consulting company was asked to help a global automotive manufacturer with the management goal of measuring service flow performance. In their new dashboard, they added dimensions for load, latency, and open problems for each component. Initial SLO management dashboard. Saturation.

Automotive

Automotive Latency Architecture Azure

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

MARCH 7, 2024

Occasionally, these use cases involve terabytes of data, so we have to pay attention to performance. The user can choose the most suitable tool for manipulating data, such as Pandas or Polars to use a dataframe API, or one of our internal C++ libraries for various high-performance operations.

Systems

Systems Media Cache Open Source

What is cloud migration?

Dynatrace

SEPTEMBER 30, 2021

In case of a spike in traffic, you can automatically spin up more resources, often in a matter of seconds. Likewise, you can scale down when your application experiences decreased traffic. For example, as traffic increases, costs will too. Improved performance and availability. Inconsistent performance.

Cloud

Cloud Traffic Best Practices Strategy

Real user monitoring vs. synthetic monitoring: Understanding best practices

Dynatrace

JUNE 27, 2022

These development and testing practices ensure the performance of critical applications and resources to deliver loyalty-building user experiences. Real user monitoring (RUM) is a performance monitoring process that collects detailed data about users’ interactions with an application. What is real user monitoring?

Best Practices

Best Practices Monitoring Wireless Traffic

How LinkedIn Serves Over 4.8 Million Member Profiles per Second

InfoQ

JULY 3, 2023

LinkedIn introduced Couchbase as a centralized caching tier for scaling member profile reads to handle increasing traffic that has outgrown their existing database cluster. The new solution achieved over 99% hit rate, helped reduce tail latencies by more than 60% and costs by 10% annually. By Rafal Gancarz

Cache

Cache Latency Traffic Database

A Management Maturity Model for Performance

Alex Russell

MAY 9, 2022

Despite advances in browser tooling , automated evaluation , lab tools , guidance , and runtimes , however, teams struggle to deliver even decent performance with today's popular frameworks. What is Performance? It may seem a silly question, but what is performance, exactly?

Performance

Performance Latency Metrics Engineering

The road to observability demo part 3: Collect, instrument, and analyze telemetry data automatically with Dynatrace

Dynatrace

MAY 17, 2023

Making applications observable—relying on metrics, logs, and traces to understand what software is doing and how it’s performing—has become increasingly important as workloads are shifting to multicloud environments. This will get us straight to the application page, where we get more insight on how our front end actually performs.

Metrics

Metrics Monitoring Database Network

Automated observability, security, and reliability at scale

Dynatrace

JULY 18, 2023

While infrastructure has historically been treated as a bottleneck where proper scaling and compute power are applied to improve performance, these aspects are now typically addressed by hyperscalers that offer cloud-based infrastructure and infrastructure as a service.

Best Practices

Best Practices Code Infrastructure Latency

The Best Way to Host MongoDB on DigitalOcean

Scalegrid

DECEMBER 16, 2019

What’s most impressive is that you’re not compromising performance for cost. We ran performance tests for MongoDB on DigitalOcean vs. AWS vs. Azure and found that DigitalOcean performance was in line with, if not better, on both high throughput and low latency in the deployment. Monitoring Performance.

Azure

Azure AWS Latency Database

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Maximize user experience with out-of-the-box service-performance SLOs

Trending Sources

The Power of Caching: Boosting API Performance and Scalability

What are quality gates? How to use quality gates to deliver better software at speed and scale

FIFO vs. LIFO: Which Queueing Strategy Is Better for Availability and Latency?

Bending pause times to your will with Generational ZGC

Event-Based Autoscaling: Ensuring Smooth Operations on Your Peak Days

Service level objectives: 5 SLOs to get started

Service level objective examples: 5 SLO examples for faster, more reliable apps

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

How Dynatrace boosts production resilience with Site Reliability Guardian

Types Of Performance Testing and When to Use Them

Site reliability done right: 5 SRE best practices that deliver on business objectives

Automated Change Impact Analysis with Site Reliability Guardian

Crucial Redis Monitoring Metrics You Must Watch

Implementing service-level objectives to improve software quality

Towards a Unified Theory of Web Performance

MySQL Key Performance Indicators (KPI) With PMM

Optimizing CDN Architecture: Enhancing Performance and User Experience

Optimizing CDN Architecture: Enhancing Performance and User Experience

Native App Network Performance Analysis

SLOs done right: how DevOps teams can build better service-level objectives

How We Optimized Performance To Serve A Global Audience

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Build and operate multicloud FaaS with enhanced, intelligent end-to-end observability

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Optimizing Video Streaming CDN Architecture for Cost Reduction and Enhanced Streaming Performance

Optimizing Video Streaming CDN Architecture for Cost Reduction and Enhanced Streaming Performance

Data Reprocessing Pipeline in Asset Management Platform @Netflix

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

Taiji: managing global user traffic for large-scale Internet services at the edge

Multi-CDN Strategy: Benefits and Best Practices

Seamlessly Swapping the API backend of the Netflix Android app

Redis vs Memcached in 2024

Towards a Reliable Device Management Platform

Lessons learned from enterprise service-level objective management

Supporting Diverse ML Systems at Netflix

What is cloud migration?

Real user monitoring vs. synthetic monitoring: Understanding best practices

How LinkedIn Serves Over 4.8 Million Member Profiles per Second

A Management Maturity Model for Performance

The road to observability demo part 3: Collect, instrument, and analyze telemetry data automatically with Dynatrace

Automated observability, security, and reliability at scale

The Best Way to Host MongoDB on DigitalOcean

Stay Connected