Remove Blog Remove Efficiency Remove Presentation Remove Systems
article thumbnail

Efficient SLO event integration powers successful AIOps

Dynatrace

This blog post is for both novice and seasoned audiences alike. The first part of this blog post briefly explores the integration of SLO events with AI. This implies that when the status is unfavorable, implementing sophisticated alerting methods like error budget burn rate alerting presents challenges and is therefore not applicable.

article thumbnail

Hawkins: Diving into the Reasoning Behind our Design System

The Netflix TechBlog

Stranger Things imagery showcasing the inspiration for the Hawkins Design System by Hawkins team member Joshua Godi ; with art contributions by Wiki Chaves Hawkins may be the name of a fictional town in Indiana, most widely known as the backdrop for one of Netflix’s most popular TV series “Stranger Things,” but the name is so much more.

Design 230
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

This is where large-scale system migrations come into play. Our previous blog post presented replay traffic testing — a crucial instrument in our toolkit that allows us to implement these transformations with precision and reliability. Canaries and sticky canaries are valuable tools in the system migration process.

Traffic 279
article thumbnail

Evolving from Rule-based Classifier: Machine Learning Powered Auto Remediation in Netflix Data…

The Netflix TechBlog

In this blog post, we present our project on Auto Remediation, which integrates the currently used rule-based classifier with an ML service and aims to automatically remediate failed jobs without human intervention. the retry success probability) and compute cost efficiency (i.e., Multi-objective optimizations.

Tuning 210
article thumbnail

Monitoring Distributed Systems

Dotcom-Montior

Web developers or administrators did not have to worry or even consider the complexity of distributed systems of today. Great, your system was ready to be deployed. Once the system was deployed, to ensure everything was running smoothly, it only took a couple of simple checks to verify. What is a Distributed System?

Systems 74
article thumbnail

Rebuilding Netflix Video Processing Pipeline with Microservices

The Netflix TechBlog

This introductory blog focuses on an overview of our journey. Future blogs will provide deeper dives into each service, sharing insights and lessons learned from this process. This architecture shift greatly reduced the processing latency and increased system resiliency.

article thumbnail

RSA guide 2024: AI and security are top concerns for organizations in every industry

Dynatrace

At this year’s RSA conference, taking place in San Francisco from May 6-9, presenters will explore ideas such as redefining security in the age of AI. As organizations train generative AI systems with critical data, they must be aware of the security and compliance risks. ’ and ‘How compliant are we? What is generative AI?