article thumbnail

Our First Netflix Data Engineering Summit

The Netflix TechBlog

Engineers from across the company came together to share best practices on everything from Data Processing Patterns to Building Reliable Data Pipelines. The result was a series of talks which we are now sharing with the rest of the Data Engineering community! In this video, Sr. In this video, Sr.

article thumbnail

Offline Data Pipeline Best Practices Part 1:Optimizing Airflow Job Parameters for Apache Hive

DZone

Welcome to the first post in our exciting series on mastering offline data pipeline's best practices, focusing on the potent combination of Apache Airflow and data processing engines like Hive and Spark. Working together, they form the backbone of many modern data engineering solutions.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Friends don't let friends build data pipelines

Abhishek Tiwari

Unfortunately, building data pipelines remains a daunting, time-consuming, and costly activity. Not everyone is operating at Netflix or Spotify scale data engineering function. Often companies underestimate the necessary effort and cost involved to build and maintain data pipelines.

Latency 63
article thumbnail

Scaling Appsec at Netflix (Part 2)

The Netflix TechBlog

Security Strategy and Roadmap, Threat Model, Deliverable Tracking, Residual Risk Criteria, etc). Our hiring has reflected that shift, and we’ve added more dedicated software engineers (SWEs) to the team to help us build out software. Our focus has been on improving overall security assurance as opposed to just vulnerability prevention.