article thumbnail

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

The Machine Learning Platform (MLP) team at Netflix provides an entire ecosystem of tools around Metaflow , an open source machine learning infrastructure framework we started, to empower data scientists and machine learning practitioners to build and manage a variety of ML systems. ETL workflows), as well as downstream (e.g.

Systems 226
article thumbnail

Helping VFX studios pave a path to the cloud

The Netflix TechBlog

But it’s not easy: to pull this off, VFX studios need to build and operate serious technical infrastructure (compute, storage, networking, and software licensing), otherwise known as a “ render farm.” This ultimately results in more compelling entertainment for Netflix members. including AWS Thinkbox Deadline and Pixar’s Tractor.

Cloud 282
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

What is chaos engineering?

Dynatrace

Chaos engineering is a method of testing distributed software that deliberately introduces failure and faulty scenarios to verify its resilience in the face of random disruptions. Practitioners subject software to a controlled, simulated crisis to test for unstable behavior. Chaos engineers ask why. The history of chaos engineering.

article thumbnail

Remote Workstations for the Discerning Artists

The Netflix TechBlog

Artists like to work at places where they can create groundbreaking entertainment instead of worrying about getting access to the software or source files they need. Some artists needed Centos 7 to support their compositing software, while others required Windows to use their pre-visualization software.

article thumbnail

Demystifying Interviewing for Backend Engineers @ Netflix

The Netflix TechBlog

If you want to practice, focus on medium-difficulty real-world problems you might encounter in a software engineering role. Streaming & Gaming Technologies ( [link] ) You are a distributed systems engineer working on product backend systems that support streaming video and/or mobile & cloud games.

article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

Behind these perfect moments of entertainment is a complex mechanism, with numerous gears and cogs working in harmony. This is where large-scale system migrations come into play. By tracking metrics only at the level of service being updated, we might miss capturing deviations in broader end-to-end system functionality.

Traffic 279
article thumbnail

Highlights from the O'Reilly Software Architecture Conference in San Jose 2019

O'Reilly Software

Experts explore software architecture security, design heuristics, Next Architecture, and more. Experts from across the software architecture world are coming together in San Jose for the O'Reilly Software Architecture Conference. Adam Tornhill offers a new perspective on software development that will change how you view code.