article thumbnail

Why applying chaos engineering to data-intensive applications matters

Dynatrace

Stream processing One approach to such a challenging scenario is stream processing, a computing paradigm and software architectural style for data-intensive software systems that emerged to cope with requirements for near real-time processing of massive amounts of data. This significantly increases event latency.

article thumbnail

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

As a result, site reliability has emerged as a critical success metric for many organizations. Site reliability engineering (SRE) has recently become a critical discipline in recent years as the world has shifted in favor of web-based interactions. But the transition to SRE maturity is not always easy. Service-level objectives (SLOs).

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Software engineering for machine learning: a case study

The Morning Paper

Software engineering for machine learning: a case study Amershi et al., More specifically, we’ll be looking at the results of an internal study with over 500 participants designed to figure out how product development and software engineering is changing at Microsoft with the rise of AI and ML. ICSE’19.

article thumbnail

Application observability meets developer observability: Unlock a 360º view of your environment

Dynatrace

In a recent webinar , Dynatrace DevOps activist Andi Grabner and senior software engineer Yarden Laifenfeld explored developer observability. When an incident occurs, developers need to know what data to look at, where the incident occurred, and other relevant metrics. How do you know if this problem has business impact?

article thumbnail

DevOps observability: A guide for DevOps and DevSecOps teams

Dynatrace

Site reliability engineering (SRE) is a software operations methodology that enables organizations to create highly reliable and scalable applications. SRE applies software engineering principles to operations and infrastructure processes. Site reliability engineers, or SREs, lead these efforts. Congratulations!

DevOps 203
article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

In this session, we discuss the technologies used to run a global streaming company, growing at scale, billions of metrics, benefits of chaos in production, and how culture affects your velocity and uptime. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges.

AWS 100
article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

In this session, we discuss the technologies used to run a global streaming company, growing at scale, billions of metrics, benefits of chaos in production, and how culture affects your velocity and uptime. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges.

AWS 100