Remove services-support platform-extensions
article thumbnail

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

The Machine Learning Platform (MLP) team at Netflix provides an entire ecosystem of tools around Metaflow , an open source machine learning infrastructure framework we started, to empower data scientists and machine learning practitioners to build and manage a variety of ML systems.

Systems 226
article thumbnail

Google Cloud Next 2024: AI innovation for Google Cloud

Dynatrace

Elevate AI observability with Google Cloud and Dynatrace – On-demand session Explore how Dynatrace seamlessly observes Google Cloud Platform (GCP) AI tooling such as DuetAI and VertexAI using the Dynatrace GCP integration and automated discovery. Learn to boost system reliability through proactive issue detection.

Google 261
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

OpenShift vs. Kubernetes: Understanding the differences

Dynatrace

A guide to container orchestration software Container orchestration software automates the administration of containerized workloads and services, greatly reducing the time IT staff spend keeping an application environment running smoothly. Like Kubernetes, OpenShift is an open source Kubernetes-based container platform.

article thumbnail

The value of open platforms: How an open software intelligence platform accelerates innovation

Dynatrace

What is an open platform, and why do organizations need them? An open platform can address some of the challenges organizations have experienced as they move to the cloud to become more resilient and agile in the face of disruption. Conversely, an open platform can promote interoperability and innovation.

article thumbnail

Ensuring the Successful Launch of Ads on Netflix

The Netflix TechBlog

Achieving this goal required extensive planning and implementation of measures to isolate the replay traffic environment from the production environment. We used this information to simulate a subscriber population through our AB testing platform. Finally, we conducted chaos experiments using the ChAP experimentation platform.

Traffic 342
article thumbnail

Evolving from Rule-based Classifier: Machine Learning Powered Auto Remediation in Netflix Data…

The Netflix TechBlog

Operational automation–including but not limited to, auto diagnosis, auto remediation, auto configuration, auto tuning, auto scaling, auto debugging, and auto testing–is key to the success of modern data platforms. Therefore, the operational cost increases linearly with the number of failed jobs.

Tuning 210
article thumbnail

New Prometheus-based extensions enable intelligent observability for more than 200 additional technologies

Dynatrace

Building on its advanced analytics capabilities for Prometheus data , Dynatrace now enables you to create extensions based on Prometheus metrics. Without any coding, these extensions make it easy to ingest data from these technologies and provide tailor-made analysis views and zero-config alerting. Prometheus in Kubernetes ?and