Data Engineering, Processing and Strategy - Technology Performance Pulse

Data Engineering

Processing

Strategy

Our First Netflix Data Engineering Summit

The Netflix TechBlog

DECEMBER 14, 2023

Engineers from across the company came together to share best practices on everything from Data Processing Patterns to Building Reliable Data Pipelines. The result was a series of talks which we are now sharing with the rest of the Data Engineering community! In this video, Sr.

Data Engineering

Data Engineering Engineering Software Engineering Best Practices

1. Streamlining Membership Data Engineering at Netflix with Psyberg

The Netflix TechBlog

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty At Netflix, our Membership and Finance Data Engineering team harnesses diverse data related to plans, pricing, membership life cycle, and revenue to fuel analytics, power various dashboards, and make data-informed decisions. What is late-arriving data? Let’s dive in!

Data Engineering

Data Engineering Engineering Processing Games

Join 5,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

Dynatrace

Offline Data Pipeline Best Practices Part 1:Optimizing Airflow Job Parameters for Apache Hive

DZone

DECEMBER 27, 2023

Welcome to the first post in our exciting series on mastering offline data pipeline's best practices, focusing on the potent combination of Apache Airflow and data processing engines like Hive and Spark. Working together, they form the backbone of many modern data engineering solutions.

Best Practices

Best Practices Data Engineering Big Data Games

What is IT automation?

Dynatrace

JULY 6, 2022

And what are the best strategies to reduce manual labor so your team can focus on more mission-critical issues? At its most basic, automating IT processes works by executing scripts or procedures either on a schedule or in response to particular events, such as checking a file into a code repository. So, what is IT automation?

Artificial Intelligence

Artificial Intelligence Tuning Strategy Big Data

Incremental Processing using Netflix Maestro and Apache Iceberg

The Netflix TechBlog

NOVEMBER 20, 2023

by Jun He , Yingyi Zhang , and Pawan Dixit Incremental processing is an approach to process new or changed data in workflows. The key advantage is that it only incrementally processes data that are newly added or updated to a dataset, instead of re-processing the complete dataset.

Processing

Processing Big Data Efficiency Engineering

A Day in the Life of an Experimentation and Causal Inference Scientist @ Netflix

The Netflix TechBlog

MARCH 2, 2021

At Netflix, our data scientists span many areas of technical specialization, including experimentation, causal inference, machine learning, NLP, modeling, and optimization. Together with data analytics and data engineering, we comprise the larger, centralized Data Science and Engineering group.

Analytics

Analytics C++ Innovation Engineering

Friends don't let friends build data pipelines

Abhishek Tiwari

JULY 12, 2018

In recent times, in order to gain valuable insights or to develop the data-driven products companies such as Netflix, Spotify, Uber, AirBnB have built internal data pipelines. If built correctly, data pipelines can offer strategic advantages to the business. Depending on frameworks, data processing units (a.k.a

Latency

Latency Analytics Scalability Engineering

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

There are several benefits of such optimizations like saving on storage, faster query time, cheaper downstream processing, and an increase in developer productivity by removing additional ETLs written only for query performance improvement. Some of the optimizations are prerequisites for a high-performance data warehouse.

Storage

Storage Latency Efficiency Data Engineering

Analytics at Netflix: Who we are and what we do

The Netflix TechBlog

SEPTEMBER 18, 2020

But there is far less agreement on what that term “data analytics” actually means?—?or Even within Netflix, we have many groups that do some form of data analysis, including business strategy and consumer insights. or what to call the people responsible for the work.

Analytics

Analytics Engineering Data Engineering Tuning

Data pipeline asset management with Dataflow

The Netflix TechBlog

FEBRUARY 9, 2022

Let’s define some requirements that we are interested in delivering to the Netflix data engineers or anyone who would like to schedule a workflow with some external assets in it. By the end of the migration process our Jenkins configuration went from: Figure 4. The slightly improved approach is shown on the diagram below.

Storage

Storage Data Engineering Testing Code

Formulating ‘Out of Memory Kill’ Prediction on the Netflix App as a Machine Learning Problem

The Netflix TechBlog

JULY 21, 2022

Since memory management is not something one usually associates with classification problems, this blog focuses on formulating the problem as an ML problem and the data engineering that goes along with it. We now explore each of these components individually, while highlighting the nuances of the data pipeline and pre-processing.

Big Data

Big Data Cache Engineering Data Engineering

Expanding the Cloud: Introducing Amazon QuickSight

All Things Distributed

OCTOBER 7, 2015

In such a data intensive environment, making key business decisions such as running marketing and sales campaigns, logistic planning, financial analysis and ad targeting require deriving insights from these data. However, the data infrastructure to collect, store and process data is geared toward developers (e.g.,

Cloud

Cloud Big Data AWS Analytics

A Day in the Life of a Content Analytics Engineer

The Netflix TechBlog

OCTOBER 30, 2020

We partner closely with the business strategy team to provide as much information as we can to our content executives, so that?—?combined Being an Analytics Engineer is like being a hybrid of a librarian ?? combined with their industry experience?—?they they can make the best decisions for Netflix.

Analytics

Analytics Engineering Innovation Metrics

Our First Netflix Data Engineering Summit

1. Streamlining Membership Data Engineering at Netflix with Psyberg

Trending Sources

Offline Data Pipeline Best Practices Part 1:Optimizing Airflow Job Parameters for Apache Hive

What is IT automation?

Incremental Processing using Netflix Maestro and Apache Iceberg

A Day in the Life of an Experimentation and Causal Inference Scientist @ Netflix

Friends don't let friends build data pipelines

Optimizing data warehouse storage

Analytics at Netflix: Who we are and what we do

Data pipeline asset management with Dataflow

Formulating ‘Out of Memory Kill’ Prediction on the Netflix App as a Machine Learning Problem

Expanding the Cloud: Introducing Amazon QuickSight

A Day in the Life of a Content Analytics Engineer

Stay Connected