Cache, Design, Latency and Traffic - Technology Performance Pulse

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

Key Takeaways Critical performance indicators such as latency, CPU usage, memory utilization, hit rate, and number of connected clients/slaves/evictions must be monitored to maintain Redis’s high throughput and low latency capabilities. It can achieve impressive performance, handling up to 50 million operations per second.

Metrics

Metrics Monitoring Latency Cache

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

MARCH 7, 2024

Since its inception , Metaflow has been designed to provide a human-friendly API for building data and ML (and today AI) applications and deploying them in our production infrastructure frictionlessly. Deployment: Cache To produce business value, all our Metaflow projects are deployed to work with other production systems.

Systems

Systems Media Cache Open Source

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

SEPTEMBER 8, 2020

This allows the app to query a list of “paths” in each HTTP request, and get specially formatted JSON (jsonGraph) that we use to cache the data and hydrate the UI. Looking at our high traffic UI screens (like the homepage) allowed us to identify any regressions caused by the endpoint before we enabled it for all our users.

Latency

Latency Cache Java Traffic

How LinkedIn Serves Over 4.8 Million Member Profiles per Second

InfoQ

JULY 3, 2023

LinkedIn introduced Couchbase as a centralized caching tier for scaling member profile reads to handle increasing traffic that has outgrown their existing database cluster. The new solution achieved over 99% hit rate, helped reduce tail latencies by more than 60% and costs by 10% annually. By Rafal Gancarz

Cache

Cache Latency Traffic Database

5.5 mm in 1.25 nanoseconds

Randon ASCII

JANUARY 12, 2022

That meant I started having regular meetings with the hardware engineers who were working with IBM on the CPU which gave me even more expertise on this CPU, which was critical in helping me discover a design flaw in one of its instructions , and in helping game developers master this finicky beast. To the left of that is one of the CPU cores.

Cache

Cache Latency Benchmarking Hardware

Optimizing CDN Architecture: Enhancing Performance and User Experience

IO River

NOVEMBER 2, 2023

CDNs cache content on edge servers distributed globally, reducing the distance between users and the content they want.‍CDNs use load-balancing techniques to distribute incoming traffic across multiple servers called Points of Presence (PoPs) which distribute content closer to end-users and improve overall performance.

Architecture

Architecture Cache Performance Latency

Predictive CPU isolation of containers at Netflix

The Netflix TechBlog

JUNE 4, 2019

Because microprocessors are so fast, computer architecture design has evolved towards adding various levels of caching between compute units and the main memory, in order to hide the latency of bringing the bits to the brains. This avoids thrashing caches too much for B and evens out the pressure on the L3 caches of the machine.

Cache

Cache Latency Airlines Logistics

Optimizing CDN Architecture: Enhancing Performance and User Experience

IO River

NOVEMBER 2, 2023

CDNs cache content on edge servers distributed globally, reducing the distance between users and the content they want.â€CDNs CDNs use load-balancing techniques to distribute incoming traffic across multiple servers called Points of Presence (PoPs) which distribute content closer to end-users and improve overall performance.Â

Architecture

Architecture Cache Performance Latency

Meet Hydrogen: A React Framework For Dynamic, Contextual And Personalized E-Commerce

Smashing Magazine

NOVEMBER 8, 2021

As developers, we rightfully obsess about the customer experience, relentlessly working to squeeze every millisecond out of the critical rendering path, optimize input latency, and eliminate jank. On top of this foundation, we add layers of caching, prerendering and edge delivery optimizations — not the other way around.

Cache

Cache Best Practices Strategy Servers

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

NOVEMBER 3, 2022

We introduce a caching mechanism in the API gateway layer, allowing us to offload processing from singleton leader elected controllers without giving up strict data consistency and guarantees clients observe. With traffic growth, a single leader node handling all request volume started becoming overloaded.

Cache

Cache Latency Traffic Systems

Seeing through hardware counters: a journey to threefold performance increase

The Netflix TechBlog

NOVEMBER 9, 2022

A quick canary test was free of errors and showed lower latency, which is expected given that our standard canary setup routes an equal amount of traffic to both the baseline running on 4xl and the canary on 12xl. What’s worse, average latency degraded by more than 50%, with both CPU and latency patterns becoming more “choppy.”

Hardware

Hardware Cache Performance Latency

MySQL Performance Tuning 101: Key Tips to Improve MySQL Database Performance

Percona

SEPTEMBER 1, 2023

This reduction in latency ensures that applications and websites provide a more rapid and responsive user experience. These issues often arise from suboptimal query design, missing or ineffective indexes, or dealing with large datasets. Another highly beneficial caching method is key-value caching.

Tuning

Tuning Database Performance Hardware

Expanding the Cloud: Enabling Globally Distributed Applications and Disaster Recovery

All Things Distributed

NOVEMBER 26, 2013

Cross Region Read Replicas also enable you to serve read traffic for your global customer base from regions that are nearest to them. In addition, Amazon S3 redundantly stores data in multiple facilities and is designed for 99.999999999% durability and 99.99% availability of objects over a given year.

Cloud

Cloud AWS Traffic Latency

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

Key Takeaways Redis offers complex data structures and additional features for versatile data handling, while Memcached excels in simplicity with a fast, multi-threaded architecture for basic caching needs. Redis is better suited for complex data models, and Memcached is better suited for high-throughput, string-based caching scenarios.

Cache

Cache Storage Scalability Architecture

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

The Morning Paper

MAY 14, 2019

Last time around we looked at the DeathStarBench suite of microservices-based benchmark applications and learned that microservices systems can be especially latency sensitive, and that hotspots can propagate through a microservices architecture in interesting ways. on end-to-end latency) and less than 0.15% on throughput.

Big Data

Big Data Cloud Performance Hardware

Handling user-initiated actions in an asynchronous, message-based architecture

O'Reilly Software

DECEMBER 11, 2017

This approach also minimizes network traffic throughout the architecture, as all external calls are handled by this service. In our implementation, as illustrated by message 9, we include a message time-out for late arrivals beyond a maximum latency time (TIME_OUT_LATENCY = 10 min). Solution design. Solution approach.

Architecture

Architecture Government Latency Efficiency

How We Optimized Performance To Serve A Global Audience

Smashing Magazine

AUGUST 3, 2023

These pages serve as a pivotal tool in our digital marketing strategy, not only providing valuable information about our services but also designed to be easily discoverable through search engines. It increases our visibility and enables us to draw a steady stream of organic (or “free”) traffic to our site. SEO is key to our success.

Performance

Performance Cache Traffic Metrics

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

Their design emphasizes increasing availability by spreading out files among different nodes or servers — this approach significantly reduces risks associated with losing or corrupting data due to node failure. Variations within these storage systems are called distributed file systems.

Storage

Storage Systems Big Data Azure

MongoDB Best Practices: Security, Data Modeling, & Schema Design

Percona

APRIL 17, 2023

The CFQ works well for many general use cases but lacks latency guarantees. The deadline excels at latency-sensitive use cases ( like databases ), and noop is closer to no schedule at all. Make sure the drives are mounted with noatime and also if the drives are behind a RAID controller with appropriate battery-backed cache.

Best Practices

Best Practices Design Tuning Database

Comparisons of Proxies for MySQL

Percona

MARCH 20, 2023

When designing an architecture, many components need to be considered before deciding on the best solution. In this context, features like filtering, firewalling, or caching are redundant and may consume resources that could be allocated to scaling. MySQL Router is the one that has the higher latency no matter what.

Games

Games Latency Traffic Cache

Optimizing Video Streaming CDN Architecture for Cost Reduction and Enhanced Streaming Performance

IO River

NOVEMBER 2, 2023

What Comprises Video Streaming - Traffic CharacteristicsWith the emphasis on a high-quality streaming experience, the optimization starts from the very core. Fundamentally, internet traffic can be broadly categorized into static and dynamic content. Given its unchanging nature, static content is ideal for caching.

Architecture

Architecture Performance Internet Internet

Stuff The Internet Says On Scalability For July 20th, 2018

High Scalability

JULY 20, 2018

That means multiple data indirections mean multiple cache misses. DonHopkins : NeWS differs from the current technology stack in that it was all coherently designed at once by James Gosling and David Rosenthal, by taking several steps back and thinking deeply about all the different problems it was trying to solve together.

Internet

Internet Internet Scalability Automotive

Optimizing Video Streaming CDN Architecture for Cost Reduction and Enhanced Streaming Performance

IO River

NOVEMBER 2, 2023

â€What Comprises Video Streaming - Traffic CharacteristicsWith the emphasis on a high-quality streaming experience, the optimization starts from the very core. Fundamentally, internet traffic can be broadly categorized into static and dynamic content.Â Letâ€™s analyze how you can achieve this win-win as effectively as possible!â€What

Architecture

Architecture Performance Internet Internet

Proof of Concept: Horizontal Write Scaling for MySQL With Kubernetes Operator

Percona

MAY 15, 2023

MySQL, as well as other RDBMS, are designed to work respecting the model and cannot scale in any way by fragmenting and distributing a schema, so what can be done to scale? As illustrated above, ProxySQL allows us to set up a common entry point for the application and then redirect the traffic on the base of identified sharding keys.

Traffic

Traffic Scalability Database Servers

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

DECEMBER 17, 2019

Nonetheless, we found a number of limitations that could not satisfy our requirements e.g. stalling the processing of log events until a dump is complete, missing ability to trigger dumps on demand, or implementations that block write traffic by using table locks. Designed with High Availability in mind. Writing events to any output.

Database

Database Traffic Transportation Open Source

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

DECEMBER 17, 2019

Nonetheless, we found a number of limitations that could not satisfy our requirements e.g. stalling the processing of log events until a dump is complete, missing ability to trigger dumps on demand, or implementations that block write traffic by using table locks. Designed with High Availability in mind. Writing events to any output.

Database

Database Traffic Transportation Open Source

Scaling Amazon ElastiCache for Redis with Online Cluster Resizing

All Things Distributed

NOVEMBER 21, 2017

Redis's microsecond latency has made it a de facto choice for caching. Four years ago, as part of our AWS fast data journey, we introduced Amazon ElastiCache for Redis , a fully managed, in-memory data store that operates at microsecond latency. TB of in-memory capacity in a single cluster.

Games

Games Retail Latency Education

The Surprising Effectiveness of Non-Overlapping, Sensitivity-Based Performance Models

John McCalpin

APRIL 2, 2020

The presentation discusses a family of simple performance models that I developed over the last 20 years — originally in support of processor and system design at SGI (1996-1999), IBM (1999-2005), and AMD (2006-2008), but more recently in support of system procurements at The Texas Advanced Computing Center (TACC) (2009-present).

Benchmarking

Benchmarking Performance Latency Architecture

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

All Things Distributed

OCTOBER 2, 2017

With these requirements in mind, and a willingness to question the status quo, a small group of distributed systems experts came together and designed a horizontally scalable distributed database that would scale out for both reads and writes to meet the long-term needs of our business. This was the genesis of the Amazon Dynamo database.

Internet

Internet Internet AWS Performance

A one size fits all database doesn't fit anyone

All Things Distributed

JUNE 21, 2018

Further, with the growth and scale of Amazon.com, boundless horizontal scale needed to be a key design point--scaling up simply wasn't an option. Use cases such as gaming, ad tech, and IoT lend themselves particularly well to the key-value data model where the access patterns require low-latency Gets/Puts for known key values.

Database

Database AWS Games Latency

Content Management Systems of the Future: Headless, JAMstack, ADN and Functions at the Edge

Abhishek Tiwari

NOVEMBER 3, 2018

You should expect one-time implementation cost (depending CMS and business requirements it can cost 200,000 USD to 3M USD) and yearly hosting infrastructure cost (proportional to load and traffic but typically 30,000 USD - 300,000 USD per year). This made whole publishing process really slow and painful and CMS was part of growing pain.

Systems

Systems Cache Website Network

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

All Things Distributed

JANUARY 18, 2012

a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. Today is a very exciting day as we release Amazon DynamoDB , a fast, highly reliable and cost-effective NoSQL database service designed for internet scale applications. Amazon DynamoDB offers low, predictable latencies at any scale. Comments ().

Scalability

Scalability Database Ecommerce Latency

The Performance Inequality Gap, 2021

Alex Russell

MARCH 6, 2021

A then-representative $200USD device had 4-8 slow (in-order, low-cache) cores, ~2GiB of RAM, and relatively slow MLC NAND flash storage. The fastest Androids predictably remain 18-24 months behind, owing to cheapskate choices about cache sizing by Qualcomm, Samsung Semi, and all the rest. The Moto G4 , for example.

Performance

Performance Network Cache Metrics

MongoDB Database Backup: Best Practices & Expert Tips

Percona

MAY 2, 2023

Cons of logical backups As it reads all data, it can be slow and will require disk reads too for databases that are larger than the RAM available for the WT cache—the WT cache pressure increases, which slows down the performance. Especially if going into or out of storage types that may throttle bandwidth/network traffic.

Best Practices

Best Practices Database Storage Servers

Lessons Learned Rebuilding A Large E-Commerce Website With Next.js (Case Study)

Smashing Magazine

SEPTEMBER 24, 2021

That was until we went to production with our highest traffic customer. To mitigate the performance issues, we had to add a lot of (unbudgeted) extra servers and had to aggressively cache pages on a reverse proxy. It can be hosted on a CDN like Vercel or Netlify, which results in lower latency. As a result, they found that a 0.1s

Website

Website Code Servers Analytics

Why I hate MPI (from a performance analysis perspective)

John McCalpin

AUGUST 1, 2018

Detailed analysis of the cache coherence transactions involved in shared-memory synchronization (especially across sockets) show that the lower bound on (uncontested) latency is at least an order of magnitude higher than what a hardware implementation should be able to support. Runtime contention.

Hardware

Hardware Transportation Performance Latency

HTTP/3: Performance Improvements (Part 2)

Smashing Magazine

AUGUST 22, 2021

Because we are dealing with network protocols here, we will mainly look at network aspects, of which two are most important: latency and bandwidth. Latency can be roughly defined as the time it takes to send a packet from point A (say, the client) to point B (the server). Two-way latency is often called round-trip time (RTT).

Performance

Performance Network Latency Servers

HTTP/3 From A To Z: Core Concepts (Part 1)

Smashing Magazine

AUGUST 9, 2021

You’ve probably heard things like: “HTTP/3 is much faster than HTTP/2 when there is packet loss”, or “HTTP/3 connections have less latency and take less time to set up”, and probably “HTTP/3 can send data more quickly and can send more resources in parallel”. TLS, TCP, and QUIC handshake durations ( Large preview ).

Transportation

Transportation Internet Internet Network

Can You Afford It?: Real-world Web Performance Budgets

Alex Russell

OCTOBER 22, 2017

Contended, over-subscribed cells can make “fast” networks brutally slow, transport variance can make TCP much less efficient , and the bursty nature of web traffic works against us. It simulates a link with a 400ms RTT and 400-600Kbps of throughput (plus latency variability and simulated packet loss).

Performance

Performance Benchmarking Network Mobile

Front-End Performance Checklist 2021

Smashing Magazine

JANUARY 11, 2021

Performance isn’t just a technical concern: it affects everything from accessibility to usability to search engine optimization, and when baking it into the workflow, design decisions have to be informed by their performance implications. Looking back now, things seem to have changed quite significantly. Large preview ). Large preview ).

Performance

Performance Cache Media Metrics

Front-End Performance Checklist 2020 [PDF, Apple Pages, MS Word]

Smashing Magazine

JANUARY 6, 2020

Performance isn’t just a technical concern: it affects everything from accessibility to usability to search engine optimization, and when baking it into the workflow, design decisions have to be informed by their performance implications. Looking back now, things seem to have changed quite significantly. thanks, Guy Podjarny! ).

Performance

Performance Cache Network Metrics

Front-End Performance Checklist 2019 [PDF, Apple Pages, MS Word]

Smashing Magazine

JANUARY 7, 2019

Performance isn’t just a technical concern: it matters, and when baking it into the workflow, design decisions have to be informed by their performance implications. Without a strong alignment between dev/design and business/marketing teams, performance isn’t going to sustain long-term. Large preview ).

Performance

Performance Cache Network Metrics

Crucial Redis Monitoring Metrics You Must Watch

Supporting Diverse ML Systems at Netflix

Trending Sources

Seamlessly Swapping the API backend of the Netflix Android app

How LinkedIn Serves Over 4.8 Million Member Profiles per Second

5.5 mm in 1.25 nanoseconds

Optimizing CDN Architecture: Enhancing Performance and User Experience

Predictive CPU isolation of containers at Netflix

Optimizing CDN Architecture: Enhancing Performance and User Experience

Meet Hydrogen: A React Framework For Dynamic, Contextual And Personalized E-Commerce

Consistent caching mechanism in Titus Gateway

Seeing through hardware counters: a journey to threefold performance increase

MySQL Performance Tuning 101: Key Tips to Improve MySQL Database Performance

Expanding the Cloud: Enabling Globally Distributed Applications and Disaster Recovery

Redis vs Memcached in 2024

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

Handling user-initiated actions in an asynchronous, message-based architecture

How We Optimized Performance To Serve A Global Audience

What is a Distributed Storage System

MongoDB Best Practices: Security, Data Modeling, & Schema Design

Comparisons of Proxies for MySQL

Optimizing Video Streaming CDN Architecture for Cost Reduction and Enhanced Streaming Performance

Stuff The Internet Says On Scalability For July 20th, 2018

Optimizing Video Streaming CDN Architecture for Cost Reduction and Enhanced Streaming Performance

Proof of Concept: Horizontal Write Scaling for MySQL With Kubernetes Operator

DBLog: A Generic Change-Data-Capture Framework

DBLog: A Generic Change-Data-Capture Framework

Scaling Amazon ElastiCache for Redis with Online Cluster Resizing

The Surprising Effectiveness of Non-Overlapping, Sensitivity-Based Performance Models

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

A one size fits all database doesn't fit anyone

Content Management Systems of the Future: Headless, JAMstack, ADN and Functions at the Edge

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

The Performance Inequality Gap, 2021

MongoDB Database Backup: Best Practices & Expert Tips

Lessons Learned Rebuilding A Large E-Commerce Website With Next.js (Case Study)

Why I hate MPI (from a performance analysis perspective)

HTTP/3: Performance Improvements (Part 2)

HTTP/3 From A To Z: Core Concepts (Part 1)

Can You Afford It?: Real-world Web Performance Budgets

Front-End Performance Checklist 2021

Front-End Performance Checklist 2020 [PDF, Apple Pages, MS Word]

Front-End Performance Checklist 2019 [PDF, Apple Pages, MS Word]

Stay Connected