Remove AWS Remove Exercise Remove Infrastructure Remove Open Source
article thumbnail

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

by David Berg , Ravi Kiran Chirravuri , Romain Cledat , Savin Goyal , Ferras Hamad , Ville Tuulos tl;dr Metaflow is now open-source! About two years ago, we, at our newly formed Machine Learning Infrastructure team started asking our data scientists a question: “What is the hardest thing for you as a data scientist at Netflix?”

article thumbnail

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

by David Berg , Ravi Kiran Chirravuri , Romain Cledat , Savin Goyal , Ferras Hamad , Ville Tuulos tl;dr Metaflow is now open-source! About two years ago, we, at our newly formed Machine Learning Infrastructure team started asking our data scientists a question: “What is the hardest thing for you as a data scientist at Netflix?”

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

MLOps and DevOps: Why Data Makes It Different

O'Reilly

As with many burgeoning fields and disciplines, we don’t yet have a shared canonical infrastructure stack or best practices for developing and deploying data-intensive applications. What: The Modern Stack of ML Infrastructure. Adapted from the book Effective Data Science Infrastructure. Foundational Infrastructure Layers.

DevOps 137
article thumbnail

What is APM?

Dynatrace

Millions of lines of code comprise these apps, and they include hundreds of interconnected digital services and open-source solutions , and run in containerized environments hosted across multiple cloud services. Virtual desktop infrastructure (VDI) monitoring to maximize the productivity of employees using VDI.

article thumbnail

Scaling Amazon ElastiCache for Redis with Online Cluster Resizing

All Things Distributed

Four years ago, as part of our AWS fast data journey, we introduced Amazon ElastiCache for Redis , a fully managed, in-memory data store that operates at microsecond latency. While open source Redis comes with primitives to help reshard a cluster, they are inadequate. Atomic slot migration.

Games 112
article thumbnail

Failure Modes and Continuous Resilience

Adrian Cockcroft

There are many possible failure modes, and each exercises a different aspect of resilience. This is why most AWS regions have three availability zones. The third team is the infrastructure platform team, who deal with datacenter and cloud based resources. If something fails, there should be another way for the system to succeed.

Latency 52
article thumbnail

Failure Modes and Continuous Resilience

Adrian Cockcroft

There are many possible failure modes, and each exercises a different aspect of resilience. This is why most AWS regions have three availability zones. The third team is the infrastructure platform team, who deal with datacenter and cloud based resources. If something fails, there should be another way for the system to succeed.

Latency 53