Remove Latency Remove Network Remove Performance Remove Software Engineering
article thumbnail

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

With so many of their transactions occurring online, customers are becoming more demanding, expecting websites and applications to always perform perfectly. However, cloud complexity has made software delivery challenging. The volume of travel spending booked online is expected to reach nearly $1.5

article thumbnail

Snap: a microkernel approach to host networking

The Morning Paper

Snap: a microkernel approach to host networking Marty et al., This paper describes the networking stack, Snap , that has been running in production at Google for the last three years+. You need a lot of software engineers and the willingness to rewrite a lot of software to entertain that idea. SOSP’19.

Network 92
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Designing Instagram

High Scalability

from a client it performs two parallel operations: i) persisting the action in the data store ii) publish the action in a streaming data store for a pub-sub model. User Feed Service, Media Counter Service) read the actions from the streaming data store and performs their specific tasks. After that, the various services (e.g.

Design 334
article thumbnail

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

In that scenario, the system would need to deal with the data propagation latency directly, for example, by use of timeouts or client-originated update tracking mechanisms. We started seeing increased response latencies and leader servers running at dangerously high utilization.

Cache 224
article thumbnail

Evolution of ML Fact Store

The Netflix TechBlog

The first version of our logger library optimized for storage by deduplicating facts and optimized for network i/o using different compression methods for each fact. Then we started hitting roadblocks while optimizing the query performance. Eventually, we decided to simplify the logger. However, Spark query execution remained slow.

Storage 187
article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

December 2 1pm-2pm CMP 326-R Capacity Management Made Easy with Amazon EC2 Auto Scaling Vadim Filanovsky , Senior Performance Engineer & Anoop Kapoor, AWS Abstract :Amazon EC2 Auto Scaling offers a hands-free capacity management experience to help customers maintain a healthy fleet, improve application availability, and reduce costs.

AWS 100
article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

December 2 1pm-2pm CMP 326-R Capacity Management Made Easy with Amazon EC2 Auto Scaling Vadim Filanovsky , Senior Performance Engineer & Anoop Kapoor, AWS Abstract :Amazon EC2 Auto Scaling offers a hands-free capacity management experience to help customers maintain a healthy fleet, improve application availability, and reduce costs.

AWS 100