Remove Analytics Remove Big Data Remove Metrics Remove Tuning
article thumbnail

Auto-Diagnosis and Remediation in Netflix Data Platform

The Netflix TechBlog

The data platform is built on top of several distributed systems, and due to the inherent nature of these systems, it is inevitable that these workloads run into failures periodically. This blog will explore these two systems and how they perform auto-diagnosis and remediation across our Big Data Platform and Real-time infrastructure.

Big Data 238
article thumbnail

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

Cloud Network Insight is a suite of solutions that provides both operational and analytical insight into the cloud network infrastructure to address the identified problems. The Flow Exporter also publishes various operational metrics to Atlas. These metrics are visualized using Lumen , a self-service dashboarding infrastructure.

Network 325
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Dynatrace

How do you get more value from petabytes of exponentially exploding, increasingly heterogeneous data? The short answer: The three pillars of observability—logs, metrics, and traces—converging on a data lakehouse. To solve this problem, Dynatrace launched Grail, its causational data lakehouse , in 2022.

Analytics 185
article thumbnail

Web Performance Bookshelf

Rigor

Take, for example, The Web Almanac , the golden collection of Big Data combined with the collective intelligence from most of the authors listed below, brilliantly spearheaded by Google’s @rick_viscomi. How to pioneer new metrics and create a culture of performance. Web Performance Tuning. Time is Money. Still good.

article thumbnail

Conducting log analysis with an observability platform and full data context

Dynatrace

Causal AI—which brings AI-enabled actionable insights to IT operations—and a data lakehouse, such as Dynatrace Grail , can help break down silos among ITOps, DevSecOps, site reliability engineering, and business analytics teams. Business leaders can decide which logs they want to use and tune storage to their data needs.

Analytics 186
article thumbnail

Python at Netflix

The Netflix TechBlog

We are heavy users of Jupyter Notebooks and nteract to analyze operational data and prototype visualization tools that help us detect capacity regressions. CORE The CORE team uses Python in our alerting and statistical analytical work. Our Infrastructure Security team leverages Python to help with IAM permission tuning using Repokid.

article thumbnail

Data Movement in Netflix Studio via Data Mesh

The Netflix TechBlog

The paradigm spans across methods, tools, and technologies and is usually defined in contrast to analytical reporting and predictive modeling which are more strategic (vs. At Netflix Studio, teams build various views of business data to provide visibility for day-to-day decision making. Please stay tuned! tactical) in nature.

Big Data 253