article thumbnail

What is? OpenTelemetry??An open-source standard for logs, metrics, and traces

Dynatrace

Loosely defined, observability is the ability to understand what’s happening inside a system from the knowledge of the external data it produces, which are usually logs, metrics, and traces. OpenTelemetry reference architecture. Logs, metrics, and traces make up the bulk of all telemetry data. What is telemetry data?

article thumbnail

Automated Deployment and Architectural Validation with Pitometer and keptn!

Dynatrace

At Neotys PAC 2019 in Chamonix, France, I presented approaches on how to solve this problem by looking at examples from companies such as Intuit, Dynatrace, Google, Netflix, T-Systems and others. Introducing Pitometer: Metrics-based Deployment Validation in your CI/CD. Beyond basic metrics: Detecting Architectural Regressions.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

OpenTelemetry services analysis and endpoint detection made easier with Dynatrace unified services

Dynatrace

Great news: OpenTelemetry endpoint detection, analyzing OpenTelemetry services, and visualizing Istio service mesh metrics just got easier. As a CNCF open source incubating project, OpenTelemetry provides a standardized set of APIs, libraries, agents, instrumentation, and specifications for logging, metrics, and tracing.

Metrics 174
article thumbnail

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

As a result, site reliability has emerged as a critical success metric for many organizations. Uptime Institute’s 2022 Outage Analysis report found that over 60% of system outages resulted in at least $100,000 in total losses, up from 39% in 2019. More than one in seven outages cost more than $1 million. availability.

article thumbnail

Open Sourcing the Netflix Domain Graph Service Framework: GraphQL for Spring Boot

The Netflix TechBlog

Our colleagues wrote a Netflix Tech Blog post describing the details of this architecture. The transition to the new federated architecture meant that many of our backend teams needed to adopt GraphQL in our Java ecosystem. Backward compatibility is even more critical when working in a Federated GraphQL architecture.

article thumbnail

Keeping Netflix Reliable Using Prioritized Load Shedding

The Netflix TechBlog

High level playback architecture with priority throttling and chaos testing Building a request taxonomy We decided to focus on three dimensions in order to categorize request traffic: throughput, functionality, and criticality. Those two metrics are approximate indicators of failures and latency.

Traffic 252
article thumbnail

How Netflix Scales its API with GraphQL Federation (Part 2)

The Netflix TechBlog

Our Journey so Far Over the past year, we’ve implemented the core infrastructure pieces necessary for a federated GraphQL architecture as described in our previous post: Studio Edge Architecture The first Domain Graph Service (DGS) on the platform was the former GraphQL monolith that we discussed in our first post (Studio API).