Blog, Latency and Storage - Technology Performance Pulse

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

At this scale, we can gain a significant amount of performance and cost benefits by optimizing the storage layout (records, objects, partitions) as the data lands into our warehouse. We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits.

Storage

Storage Latency Efficiency Data Engineering

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

Our previous tech blog Packaging award-winning shows with award-winning technology detailed our packaging technology deployed on the streaming side. From chunk encoding to assembly and packaging, the result of each previous processing step must be uploaded to cloud storage and then downloaded by the next processing step.

Cloud

Cloud Media Storage Cache

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

This blog series will examine the tools, techniques, and strategies we have utilized to achieve this goal. This blog post will provide a detailed analysis of replay traffic testing, a versatile technique we have applied in the preliminary validation phase for multiple migration initiatives.

Traffic

Traffic Latency Tuning Systems

Mayastor: Lightning Fast Storage for Kubernetes

Percona Community

OCTOBER 23, 2020

In this blog post we’re going to see those technologies at work to give us awesome block storage performance with flexibility and simple operations. It’s a new generation in storage software, designed for super high speed low latency NVMe devices. Why is SPDK exciting?

Storage

Storage Latency Speed Database

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

Uber Engineering

OCTOBER 17, 2018

To accomplish this, Uber relies heavily on making data-driven decisions at every level, from forecasting rider demand during high traffic events to identifying and addressing bottlenecks … The post Uber’s Big Data Platform: 100+ Petabytes with Minute Latency appeared first on Uber Engineering Blog.

Big Data

Big Data Latency Transportation Traffic

Improved Alerting with Atlas Streaming Eval

The Netflix TechBlog

APRIL 27, 2023

While Atlas is architected around compute & storage separation, and we could theoretically just scale the query layer to meet the increased query demand, every query, regardless of its type, has a data component that needs to be pushed down to the storage layer. The internals here are outside the scope of this blog post.

Storage

Storage Cache Metrics Database

Reducing Your Database Hosting Costs: DigitalOcean vs. AWS vs. Azure

Scalegrid

APRIL 28, 2020

Since database hosting is more dependent on memory (RAM) than storage, we are going to compare various instance sizes ranging from just 1GB of RAM up to 64GB of RAM so you can see how costs vary across different application workloads. Does it affect latency? Yes, you can see an increase in latency. EC2 instances. VM instances.

Azure

Azure AWS Database Latency

Faster time to value with enhanced handling of OneAgent runtime data

Dynatrace

SEPTEMBER 23, 2020

This blog post highlights a group of improvements that were released with OneAgent version 1.199. Storage mount points in a system might be larger or smaller, local or remote, with high or low latency, and various speeds. Storage and network transfer of files is a measurable cost. See details below. See details below.

Storage

Storage Latency Operating System Network

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

This blog post lists the important database metrics to monitor. Key Takeaways Critical performance indicators such as latency, CPU usage, memory utilization, hit rate, and number of connected clients/slaves/evictions must be monitored to maintain Redis’s high throughput and low latency capabilities.

Metrics

Metrics Monitoring Latency Cache

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

NOVEMBER 3, 2022

This blog post presents how our current iteration of Titus deals with high API call volumes by scaling out horizontally. When a new leader is elected it loads all data from external storage. We started seeing increased response latencies and leader servers running at dangerously high utilization.

Cache

Cache Latency Traffic Systems

Data ingestion pipeline with Operation Management

The Netflix TechBlog

MARCH 7, 2023

These media focused machine learning algorithms as well as other teams generate a lot of data from the media files, which we described in our previous blog , are stored as annotations in Marken. But we cannot search or present low latency retrievals from files Etc. We refer the reader to our previous blog article for details.

Media

Media Latency Architecture Database

Compression Methods in MongoDB: Snappy vs. Zstd

Percona

MARCH 29, 2023

Compression in any database is necessary as it has many advantages, like storage reduction, data transmission time, etc. Storage reduction alone results in significant cost savings, and we can save more data in the same space. In this blog, we will discuss both data and network-level compression offered in MongoDB.

Storage

Storage Network Open Source Latency

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

Percona

DECEMBER 11, 2023

A Dedicated Log Volume (DLV) is a specialized storage volume designed to house database transaction logs separately from the volume containing the database tables. DLVs are particularly advantageous for databases with large allocated storage, high I/O per second (IOPS) requirements, or latency-sensitive workloads.

AWS

AWS Benchmarking Performance Traffic

Managing risk for financial services: The secret to visibility and control during times of volatility

Dynatrace

APRIL 8, 2024

This blog explores how vertically integrated risk management solutions that use AI and automation enable unparalleled visibility, control, and efficiency for risk management in banking. Maximize performance for high-frequency and low-latency trading strategies. Automated issue resolution. Break down data silos.

Analytics

Analytics Infrastructure Efficiency Technology

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Our previous blog post presented replay traffic testing — a crucial instrument in our toolkit that allows us to implement these transformations with precision and reliability. This blog post will delve into the techniques leveraged at Netflix to introduce these changes to production.

Traffic

Traffic Metrics Systems Strategy

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

In our previous blog post we introduced Edgar, our troubleshooting tool for streaming sessions. If we had an ID for each streaming session then distributed tracing could easily reconstruct session failure by providing service topology, retry and error tags, and latency measurements for all service calls. Storage: don’t break the bank!

Infrastructure

Infrastructure Transportation Storage Open Source

The AWS Storage Gateway - All Things Distributed

All Things Distributed

JANUARY 23, 2012

Expanding the Cloud - The AWS Storage Gateway. Today Amazon Web Services has launched the AWS Storage Gateway, making the power of secureÂ and reliable cloud storage accessible from customersâ?? With the launch of the AWS Storage Gateway our customers can now integrate their on-premises IT environment with AWSâ??s

Storage

Storage AWS Virtualization Cloud

Using Docker To Deploy Neon Serverless PostgreSQL

Percona

MARCH 13, 2023

There is a section in our Documentation ( Introduction to Serverless PostgreSQL ) and a short overview of the primary components: Page Server The storage server with the primary goal of storing all data pages and WAL records Safe Keeper A component to store WAL records in memory (to reduce latency). 5454 --listen-http=0.0.0.0:7676

Serverless

Serverless C++ Storage Latency

Edgar: Solving Mysteries Faster with Observability

The Netflix TechBlog

SEPTEMBER 2, 2020

In an earlier blog post, we discussed Telltale , our health monitoring system. Deriving meaningful value from trace data alone can be challenging, as Cindy Sridharan articulated in this blog post. Telltale provides Edgar with latency benchmarks that indicate if the individual trace’s latency is abnormal for this given service.

Latency

Latency Transportation Engineering Traffic

How To Scale a Single-Host PostgreSQL Database With Citus

Percona

NOVEMBER 3, 2023

xlarge 4vCPU 8GB-RAM Storage: EBS volume (root) 80GB gp2 (IOPS 240/3000) As well, high availability will be integrated, guaranteeing cluster viability in the case that one worker node goes down. And now, execute the benchmark: -- execute the following on the coordinator node pgbench -c 20 -j 3 -T 60 -P 3 pgbench The results are not pretty.

Database

Database Benchmarking Latency C++

File systems unfit as distributed storage backends: lessons from ten years of Ceph evolution

The Morning Paper

NOVEMBER 5, 2019

File systems unfit as distributed storage backends: lessons from 10 years of Ceph evolution Aghayev et al., In this case, the assumption that a distributed storage backend should clearly be layered on top of a local file system. What is a distributed storage backend? SOSP’19. This is not surprising in hindsight.

Storage

Storage Systems Hardware Efficiency

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

JULY 4, 2021

AWS Graviton2); for memory with the arrival of DDR5 and High Bandwidth Memory (HBM) on-processor; for storage including new uses for 3D Xpoint as a 3D NAND accelerator; for networking with the rise of QUIC and eXpress Data Path (XDP); and so on. Ford, et al., “TCP

Performance

Performance Latency Hardware Storage

Netflix Drive

The Netflix TechBlog

MAY 5, 2021

Netflix Drive relies on a data store that will be the persistent storage layer for assets, and a metadata store which will provide a relevant mapping from the file system hierarchy to the data store entities. We will cover the different namespaces of Netflix Drive in more detail in a subsequent blog post. A sample manifest file.

Media

Media Storage Architecture Cloud

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

JUNE 25, 2020

The network latency between cluster nodes should be around 10 ms or less. For Premium HA, this has been extended from 10 ms latency (in the same network region) to around 100 ms network latency due to asynchronous data replication between regions. In the image below, three downed nodes make an entire cluster unavailable.

Availability

Availability Hardware Latency Traffic

Implementing AWS well-architected pillars with automated workflows

Dynatrace

SEPTEMBER 13, 2023

In this blog post, we’ll demonstrate how Dynatrace automation and the Dynatrace Site Reliability Guardian can help you implement your applications according to all six AWS Well-Architected pillars by integrating them into your software development lifecycle (SDLC).

AWS

AWS Efficiency Azure Cloud

Evolution of ML Fact Store

The Netflix TechBlog

APRIL 26, 2022

The first version of our logger library optimized for storage by deduplicating facts and optimized for network i/o using different compression methods for each fact. Since we were optimizing at the logging level for storage and performance, we had less data and metadata to play with to optimize the query performance.

Storage

Storage Design Scalability Latency

InnoDB Performance Optimization Basics

Percona

MARCH 23, 2023

This blog is in reference to our previous ones for ‘Innodb Performance Optimizations Basics’ 2007 and 2013. Although there have been many blogs about adjusting MySQL variables for better performance since then, I think this topic deserves a blog update since the last update was a decade ago, and MySQL 5.7

Performance

Performance Hardware Tuning Storage

Scale up your Dynatrace Managed software-intelligence deployment with self-healing insights

Dynatrace

JUNE 8, 2020

As Dynatrace deployments grow rapidly, we’re making it easier for Dynatrace Managed customers to proactively monitor and plan their network, storage, and compute power requirements—so that we can deliver the SaaS experience on top of it. An illustration of the cluster overview dashboard is shown below.

Software

Software Software Programming Metrics

Cache-Control for Civilians

CSS Wizardry

MARCH 3, 2019

A great candidate for must-revalidate is a blog like mine: static pages that seldom change. If, however, there wasn’t a new file on the server, we’ll bring back a 304 header, no new file, but an entire roundtrip of latency. We can completely cut out the overhead of a roundtrip of latency. stale-while-revalidate.

Cache

Cache Latency Strategy Servers

Growth Engineering at Netflix?—?Automated Imagery Generation

The Netflix TechBlog

FEBRUARY 9, 2021

Server-generated assets, since client-side generation would require the retrieval of many individual images, which would increase latency and time-to-render. To reduce latency, assets should be generated in an offline fashion and not in real time. This requires an asset storage solution.

Engineering

Engineering Storage Latency Entertainment

Observability platform vs. observability tools

Dynatrace

DECEMBER 22, 2021

Metrics are measures of critical system values, such as CPU utilization or average write latency to persistent storage. A database could start executing a storage management process that consumes database server resources. The post Observability platform vs. observability tools appeared first on Dynatrace blog.

Artificial Intelligence

Artificial Intelligence Metrics Architecture DevOps

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Dynatrace

DECEMBER 22, 2019

AWS offers a broad set of global, cloud-based services including computing, storage, networking, Internet of Things (IoT), and many others. Amazon Simple Storage Service (S3). The example below visualizes average latency by API name and stage for a specific AWS API Gateway. Dynatrace news. Amazon Kinesis Video Streams.

AWS

AWS Metrics IoT Storage

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Dynatrace

DECEMBER 22, 2019

AWS offers a broad set of global, cloud-based services including computing, storage, networking, Internet of Things (IoT), and many others. Amazon Simple Storage Service (S3). The example below visualizes average latency by API name and stage for a specific AWS API Gateway. Dynatrace news. Amazon Kinesis Video Streams.

AWS

AWS Metrics IoT Storage

Optimize Citrix platform performance and user experience with a new extension (Preview)

Dynatrace

SEPTEMBER 25, 2019

Therefore, it requires multidimensional and multidisciplinary monitoring: Infrastructure health —automatically monitor the compute, storage, and network resources available to the Citrix system to ensure a stable platform. Citrix platform performance—optimize your Citrix landscape with insights into user load and screen latency per server.

Latency

Latency Performance Virtualization Infrastructure

MySQL Key Performance Indicators (KPI) With PMM

Percona

JUNE 22, 2023

In this blog, we will explore various MySQL KPIs that are basic and essential to track using monitoring tools like PMM. Replication lag can occur due to various factors such as network latency, system resource limitations, complex transactions, or heavy write loads on the primary/master database.

Performance

Performance Monitoring Traffic Database

Netflix Video Quality at Scale with Cosmos Microservices

The Netflix TechBlog

NOVEMBER 2, 2021

Cosmos offers several benefits as highlighted in the linked blog, such as separation of concerns, independent deployments, observability, rapid prototyping and productization. This enables us to use our scale to increase throughput and reduce latencies. We call this system Cosmos. VQS is called using the measureQuality endpoint.

Media

Media Innovation Metrics Latency

Hudi: Uber Engineering’s Incremental Processing Framework on Apache Hadoop

Uber Engineering

MARCH 12, 2017

With the evolution of storage formats like Apache Parquet and Apache ORC and query engines like Presto and Apache Impala , the Hadoop ecosystem has the potential to become a general-purpose, unified serving layer for workloads that can tolerate latencies … The post Hudi: Uber Engineering’s Incremental Processing Framework on Apache Hadoop appeared (..)

Processing

Processing Latency Storage Engineering

How digital experience monitoring helps deliver business observability

Dynatrace

APRIL 26, 2022

STM generates traffic that replicates the typical path or behavior of a user on a network to measure performance for example, response times, availability, packet loss, latency, jitter, and other variables). The post How digital experience monitoring helps deliver business observability appeared first on Dynatrace blog.

Monitoring

Monitoring Social Media IoT Metrics

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

JULY 4, 2021

AWS Graviton2); for memory with the arrival of DDR5 and High Bandwidth Memory (HBM) on-processor; for storage including new uses for 3D Xpoint as a 3D NAND accelerator; for networking with the rise of QUIC and eXpress Data Path (XDP); and so on. Ford, et al., “TCP

Performance

Performance Latency Hardware Storage

Seamless offloading of web app computations from mobile device to edge clouds via HTML5 Web Worker migration

The Morning Paper

JANUARY 30, 2020

Edge servers are the middle ground – more compute power than a mobile device, but with latency of just a few ms. The client MWW combines these estimates with an estimate of the input/output transmission time (latency) to find the worker with the minimum overall execution latency.

Mobile

Mobile Cloud Latency Games

Titan Graph Database Integration with DynamoDB: World-class Performance, Availability, and Scale for New Workloads

All Things Distributed

AUGUST 20, 2015

Today, we are releasing a plugin that allows customers to use the Titan graph engine with Amazon DynamoDB as the backend storage layer. It opens up the possibility to enjoy the value that graph databases bring to relationship-centric use cases, without worrying about managing the underlying storage. The importance of relationships.

Database

Database Logistics Availability Social Media

Aurora vs RDS: How to Choose the Right AWS Database Solution

Percona

JULY 1, 2023

In this blog, we will answer all of these important questions and provide a general overview comparing the two database services, Aurora vs RDS. It efficiently manages read and write operations, optimizes data access, and minimizes contention, resulting in high throughput and low latency to ensure that applications perform at their best.

AWS

AWS Database Serverless Storage

A one size fits all database doesn't fit anyone

All Things Distributed

JUNE 21, 2018

That learning is at the heart of this blog post—databases are built for a purpose and matching the use case with the database will help you write high-performance, scalable, and more functional applications faster. The purpose of DynamoDB is to provide consistent single-digit millisecond latency for any scale of workloads.

Database

Database AWS Games Latency

Desktop Application Testing vs Web Application Testing

Testsigma

JULY 23, 2020

This is a standalone software program which doesn’t depend on any internet connectivity for its working and its performance is not impacted because of any network related latencies. Any network-related latencies result in performance hindrances in these types of applications. Check out how Testsigma performs automated web application.

Testing

Testing Internet Internet Latency

Optimizing data warehouse storage

Netflix Cloud Packaging in the Terabyte Era

Trending Sources

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Mayastor: Lightning Fast Storage for Kubernetes

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

Improved Alerting with Atlas Streaming Eval

Reducing Your Database Hosting Costs: DigitalOcean vs. AWS vs. Azure

Faster time to value with enhanced handling of OneAgent runtime data

Crucial Redis Monitoring Metrics You Must Watch

Consistent caching mechanism in Titus Gateway

Data ingestion pipeline with Operation Management

Compression Methods in MongoDB: Snappy vs. Zstd

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

Managing risk for financial services: The secret to visibility and control during times of volatility

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Building Netflix’s Distributed Tracing Infrastructure

The AWS Storage Gateway - All Things Distributed

Using Docker To Deploy Neon Serverless PostgreSQL

Edgar: Solving Mysteries Faster with Observability

How To Scale a Single-Host PostgreSQL Database With Citus

File systems unfit as distributed storage backends: lessons from ten years of Ceph evolution

USENIX LISA2021 Computing Performance: On the Horizon

Netflix Drive

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Implementing AWS well-architected pillars with automated workflows

Evolution of ML Fact Store

InnoDB Performance Optimization Basics

Scale up your Dynatrace Managed software-intelligence deployment with self-healing insights

Cache-Control for Civilians

Growth Engineering at Netflix?—?Automated Imagery Generation

Observability platform vs. observability tools

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Optimize Citrix platform performance and user experience with a new extension (Preview)

MySQL Key Performance Indicators (KPI) With PMM

Netflix Video Quality at Scale with Cosmos Microservices

Hudi: Uber Engineering’s Incremental Processing Framework on Apache Hadoop

How digital experience monitoring helps deliver business observability

USENIX LISA2021 Computing Performance: On the Horizon

Seamless offloading of web app computations from mobile device to edge clouds via HTML5 Web Worker migration

Titan Graph Database Integration with DynamoDB: World-class Performance, Availability, and Scale for New Workloads

Aurora vs RDS: How to Choose the Right AWS Database Solution

A one size fits all database doesn't fit anyone

Desktop Application Testing vs Web Application Testing

Stay Connected