article thumbnail

Hyper Scale VPC Flow Logs enrichment to provide Network Insight

The Netflix TechBlog

And in order to gain visibility into these logs, we need to somehow ingest and enrich this data. It is easier to tune a large Spark job for a consistent volume of data. In other words, we are able to ensure that our Spark app does not “eat” more data than it was tuned to handle. We named this library Sqooby.

Network 150
article thumbnail

Incremental Processing using Netflix Maestro and Apache Iceberg

The Netflix TechBlog

These challenges are currently addressed in suboptimal and less cost efficient ways by individual local teams to fulfill the needs, such as Lookback: This is a generic and simple approach that data engineers use to solve the data accuracy problem. Users configure the workflow to read the data in a window (e.g.

article thumbnail

Organise your engineering teams around the work by reteaming

Abhishek Tiwari

Because you are changing team composition, you need robust norms of conduct and engineering practices in place. Secondly, fine-tune team composition based on work. Thirdly, let engineers themselves choose the delivery teams and organise them around the initiative. Velocity is directional (a vector in mathematical term).