Data and Processing - Technology Performance Pulse

Cutting Big Data Costs: Effective Data Processing With Apache Spark

DZone

SEPTEMBER 14, 2023

In today's data-driven world, efficient data processing plays a pivotal role in the success of any project. Apache Spark , a robust open-source data processing framework, has emerged as a game-changer in this domain.

Big Data

Big Data Processing Games Open Source

Batch Processing for Data Integration

DZone

NOVEMBER 7, 2023

In the labyrinth of data-driven architectures, the challenge of data integration—fusing data from disparate sources into a coherent, usable form — stands as one of the cornerstones. As businesses amass data at an unprecedented pace, the question of how to integrate this data effectively comes to the fore.

Processing

Processing Architecture Technology Technology

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

The Netflix TechBlog

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty In the inaugural blog post of this series, we introduced you to the state of our pipelines before Psyberg and the challenges with incremental processing that led us to create the Psyberg framework within Netflix’s Membership and Finance data engineering team.

Processing

Processing Data Engineering Efficiency Analytics

Processing Time Series Data With QuestDB and Apache Kafka

DZone

APRIL 4, 2023

Apache Kafka is a battle-tested distributed stream-processing platform popular in the financial industry to handle mission-critical transactional workloads. Kafka’s ability to handle large volumes of real-time market data makes it a core infrastructure component for trading, risk management, and fraud detection.

Processing

Processing Database Infrastructure Testing

Exploring Parallel Processing: SIMD vs. MIMD Architectures

DZone

FEBRUARY 20, 2024

In the landscape of computer architecture, two prominent paradigms shape the realm of parallel processing: SIMD (Single Instruction, Multiple Data) and MIMD (Multiple Instruction, Multiple Data) architectures.

Architecture

Architecture Processing Efficiency Design

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace

JANUARY 31, 2024

Organizations choose data-driven approaches to maximize the value of their data, achieve better business outcomes, and realize cost savings by improving their products, services, and processes. However, there are many obstacles and limitations along the way to becoming a data-driven organization.

Analytics

Analytics Processing Transportation Storage

Business Flow: Why IT operations teams should monitor business processes

Dynatrace

MARCH 12, 2024

The business process observability challenge Increasingly dynamic business conditions demand business agility; reacting to a supply chain disruption and optimizing order fulfillment are simple but illustrative examples. Most business processes are not monitored. First and foremost, it’s a data problem.

Processing

Processing Monitoring Analytics C++

Improving customer experience with business process monitoring

Dynatrace

DECEMBER 21, 2023

A business process is a collection of related, usually structured tasks or steps, performed in sequence, that achieve a defined business goal. Tasks may be manual or automatic, and many business processes will include a combination of both. Make better decisions by providing managers with real-time data about the business.

Processing

Processing Monitoring Retail Analytics

Medallion Architecture: Efficient Batch and Stream Processing Data Pipelines With Azure Databricks and Delta Lake

DZone

JULY 13, 2023

In today's data-driven world, organizations need efficient and scalable data pipelines to process and analyze large volumes of data. Medallion Architecture provides a framework for organizing data processing workflows into different zones, enabling optimized batch and stream processing.

Azure

Azure Architecture Efficiency Processing

Stream Processing vs. Batch Processing: What to Know

DZone

JANUARY 31, 2023

Big data is at the center of all business decisions these days. It refers to large volumes of data generated through different sources, and this data then provides the foundation for business decisions. There are different ways through which we can process data. What Is Batch Processing?

Processing

Processing Big Data Systems

Dynatrace completed Data Privacy Framework self-certification

Dynatrace

APRIL 22, 2024

To enable participating organizations to meet the EU requirements for transferring personal data to the U.S., the Data Privacy Framework (DPF) is designed to serve as an adequate data transfer mechanism under the GDPR. Data Privacy Framework Program (The EU-U.S. Benefits of Data Privacy Framework for Dynatrace customers.

Government

Government Programming Analytics Efficiency

Practical business process monitoring for real-time business observability

Dynatrace

FEBRUARY 9, 2024

Recent platform enhancements in the latest Dynatrace, including business events powered by Grail™, make accessing the goldmine of business data flowing through your IT systems easier than ever. The Business Flow app Business Flow, built with AppEngine, simplifies the configuration, monitoring, and analysis of business processes.

Processing

Processing Monitoring Analytics Games

Unlock the observability value of log data with processing at scale

Dynatrace

AUGUST 16, 2022

The data locked in your log files can be a goldmine for your application developers, operations teams, and your enterprise as a whole. However, it can be complicated , expensive , or even impossible to set up robust observability that makes use of this data. Log format inconsistency makes it a challenge to access critical data.

Processing

Processing Metrics Monitoring Java

Enhance data collection with Dynatrace OpenTelemetry Collector distribution

Dynatrace

MARCH 15, 2024

As organizations strive for observability and data democratization, OpenTelemetry emerges as a key technology to create and transfer observability data. Understanding OpenTelemetry OpenTelemetry is an open, vendor-neutral standard for creating, collecting, and transferring telemetry data, like traces, metrics, and logs.

Open Source

Open Source Best Practices Infrastructure Tuning

Batch Processing vs. Stream Processing: Why Batch Is Dying and Streaming Takes Over

DZone

MARCH 31, 2023

In the digital age, data is the new currency and is being used everywhere. From social media to IoT devices, businesses are generating more data than ever before. With this data comes the challenge of processing it in a timely and efficient way. Let’s recap some of the basics first.

Processing

Processing Social Media IoT Media

Our First Netflix Data Engineering Summit

The Netflix TechBlog

DECEMBER 14, 2023

Engineers from across the company came together to share best practices on everything from Data Processing Patterns to Building Reliable Data Pipelines. The result was a series of talks which we are now sharing with the rest of the Data Engineering community!

Data Engineering

Data Engineering Engineering Software Engineering Best Practices

Financial Data Engineering in SAS

DZone

JANUARY 8, 2024

Financial data engineering in SAS involves the management, processing, and analysis of financial data using the various tools and techniques provided by the SAS software suite. Here are some key aspects of financial data engineering in SAS: 1.

Data Engineering

Data Engineering Engineering Database Software

Data Integration in Real-Time Systems

DZone

NOVEMBER 7, 2023

In the rapidly evolving digital landscape, the role of data has shifted from being merely a byproduct of business to becoming its lifeblood. With businesses constantly in the race to stay ahead, the process of integrating this data becomes crucial.

Systems

Systems Analytics Architecture Engineering

Exploring Apache Airflow for Batch Processing Scenario

DZone

NOVEMBER 30, 2023

It uses Python as its programming language and offers a flexible architecture suited for both small-scale and large-scale data processing. The platform supports the concept of Directed Acyclic Graphs to define workflows, making it easy to visualize complex data pipelines.

Processing

Processing Open Source Programming Architecture

1. Streamlining Membership Data Engineering at Netflix with Psyberg

The Netflix TechBlog

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty At Netflix, our Membership and Finance Data Engineering team harnesses diverse data related to plans, pricing, membership life cycle, and revenue to fuel analytics, power various dashboards, and make data-informed decisions. We expect complete and accurate data at the end of each run.

Data Engineering

Data Engineering Engineering Processing Games

AI techniques enhance and accelerate exploratory data analytics

Dynatrace

FEBRUARY 28, 2024

In a digital-first world, site reliability engineers and IT data analysts face numerous challenges with data quality and reliability in their quest for cloud control. Increasingly, organizations seek to address these problems using AI techniques as part of their exploratory data analytics practices.

Analytics

Analytics Metrics Media Monitoring

Performance Optimization in ETL Processes

DZone

NOVEMBER 20, 2023

ETL—Extract, Transform, Load—is far more than a set of operations; it's a complex dance that transforms raw data into valuable insights, serving as the critical backbone for a range of applications, from data analytics and business intelligence to real-time decision-making platforms. What makes ETL performance such an imperative?

Processing

Processing Performance Analytics Speed

Data Reprocessing Pipeline in Asset Management Platform @Netflix

The Netflix TechBlog

MARCH 10, 2023

This platform has evolved from supporting studio applications to data science applications, machine-learning applications to discover the assets metadata, and build various data facts. Hence we built the data pipeline that can be used to extract the existing assets metadata and process it specifically to each new use case.

Media

Media Traffic Processing Design

How TripleLift Built an Adtech Data Pipeline Processing Billions of Events Per Day

High Scalability

JUNE 15, 2020

This is a guest post by Eunice Do , Data Engineer at TripleLift , a technology company leading the next generation of programmatic advertising. The system is the data pipeline at TripleLift. TripleLift is an adtech company, and like most companies in this industry, we deal with high volumes of data on a daily basis.

Processing

Processing Data Engineering Efficiency Engineering

How a data lakehouse brings data insights to life

Dynatrace

OCTOBER 4, 2022

For IT infrastructure managers and site reliability engineers, or SREs , logs provide a treasure trove of data. But on their own, logs present just another data silo as IT professionals attempt to troubleshoot and remediate problems. Data volume explosion in multicloud environments poses log issues.

Analytics

Analytics Storage Infrastructure Metrics

Building an Optimized Data Pipeline on Azure Using Spark, Data Factory, Databricks, and Synapse Analytics

DZone

APRIL 11, 2023

Data processing in the cloud has become increasingly popular due to its scalability, flexibility, and cost-effectiveness. This article will explore how these technologies can be used together to create an optimized data pipeline for data processing in the cloud.

Azure

Azure Analytics Storage Cloud

Introducing Dynatrace built-in data observability on Davis AI and Grail

Dynatrace

JANUARY 31, 2024

I have ingested important custom data into Dynatrace, critical to running my applications and making accurate business decisions… but can I trust the accuracy and reliability?” ” Welcome to the world of data observability. At its core, data observability is about ensuring the availability, reliability, and quality of data.

DevOps

DevOps Analytics Airlines Metrics

Measuring the importance of data quality to causal AI success

Dynatrace

JANUARY 4, 2024

While this approach can be effective if the model is trained with a large amount of data, even in the best-case scenarios, it amounts to an informed guess, rather than a certainty. But to be successful, data quality is critical. Teams need to ensure the data is accurate and correctly represents real-world scenarios. Consistency.

Government

Government Analytics Benchmarking Storage

Managing Data Residency: Concepts and Theory

DZone

MAY 12, 2023

I believe that chief among them is d ata residency or data location: Data localization or data residency law requires data about a nation's citizens or residents to be collected, processed, and/or stored inside the country, often before being transferred internationally.

Cloud

Cloud Systems Processing

Visualizing IoT Data With MQTT, QuestDB, and Grafana

DZone

AUGUST 23, 2023

Monitoring Time-Series IoT Device Data Time-series data is crucial for IoT device monitoring and data visualization in industries such as agriculture, renewable energy, and meteorology. In this tutorial, we will guide you through the process of setting up a monitoring system for IoT device data.

IoT

IoT Energy Open Source Analytics

Enhance data collection with Dynatrace OTel Collector distribution

Dynatrace

MARCH 15, 2024

As organizations strive for observability and data democratization, OpenTelemetry emerges as a key technology to create and transfer observability data. Understanding OpenTelemetry OpenTelemetry is an open, vendor-neutral standard for creating, collecting, and transferring telemetry data, like traces, metrics, and logs.

Open Source

Open Source Best Practices Infrastructure Tuning

Edge Data Platforms, Real-Time Services, and Modern Data Trends

DZone

AUGUST 18, 2023

We all know that data is being generated at an unprecedented rate. You may also know that this has led to an increase in the demand for efficient and secure data storage solutions that won’t break the bank. This article will explore what edge data platforms and real-time services are, why they are important, and how they can be used.

IoT

IoT Media Latency Storage

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Dynatrace

OCTOBER 4, 2022

While data lakes and data warehousing architectures are commonly used modes for storing and analyzing data, a data lakehouse is an efficient third way to store and analyze data that unifies the two architectures while preserving the benefits of both. What is a data lakehouse? How does a data lakehouse work?

Artificial Intelligence

Artificial Intelligence Analytics Storage Government

Key Advantages of DBMS for Efficient Data Management

Scalegrid

JANUARY 5, 2024

Enhanced data security, better data integrity, and efficient access to information. This article cuts through the complexity to showcase the tangible benefits of DBMS, equipping you with the knowledge to make informed decisions about your data management strategies. What are the key advantages of DBMS?

Efficiency

Efficiency Storage Database Scalability

MongoDB Rollback: How to Minimize Data Loss

Scalegrid

JANUARY 19, 2024

When a MongoDB rollback happens, it can cause trouble to your data integrity and system consistency. Understanding how to address a rollback is critical for minimizing potential data loss and maintaining seamless operations. With direct, actionable insights, prepare to navigate the complexities of rollbacks with confidence.

Database

Database Network Servers Monitoring

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

Dynatrace

JANUARY 23, 2024

AI data analysis can help development teams release software faster and at higher quality. So how can organizations ensure data quality, reliability, and freshness for AI-driven answers and insights? And how can they take advantage of AI without incurring skyrocketing costs to store, manage, and query data?

Performance

Performance DevOps Innovation Artificial Intelligence

Data privacy by design: How an observability platform protects data security

Dynatrace

APRIL 19, 2023

Creating an ecosystem that facilitates data security and data privacy by design can be difficult, but it’s critical to securing information. When organizations focus on data privacy by design, they build security considerations into cloud systems upfront rather than as a bolt-on consideration.

Design

Design Storage Programming Analytics

Boost DevOps maturity with observability and a data lakehouse

Dynatrace

JUNE 9, 2023

ln a world driven by macroeconomic uncertainty, businesses increasingly turn to data-driven decision-making to stay agile. They’re unleashing the power of cloud-based analytics on large data sets to unlock the insights they and the business need to make smarter decisions. All of these factors challenge DevOps maturity.

DevOps

DevOps Analytics Storage Metrics

Overcoming Challenges and Best Practices for Data Migration From On-Premise to Cloud

DZone

MARCH 29, 2023

Data migration is the process of moving data from one location to another, which is an essential aspect of cloud migration. Data migration involves transferring data from on-premise storage to the cloud. With the rapid adoption of cloud computing , businesses are moving their IT infrastructure to the cloud.

Best Practices

Best Practices Cloud Storage Data Engineering

Batch Processing: 4 Tactics to Make It Cost-Efficient and Reliable

DZone

AUGUST 10, 2023

Many modern applications have a batch processing aspect to them and regularly run high-volume, repetitive data jobs. Here are the four tactics for cost-efficient and resilient batch processing: Here are the four tactics for cost-efficient and resilient batch processing:

Efficiency

Efficiency Processing Cloud Technology

Auto-Diagnosis and Remediation in Netflix Data Platform

The Netflix TechBlog

JANUARY 13, 2022

By Vikram Srivastava and Marcelo Mayworm Netflix has one of the most complex data platforms in the cloud on which our data scientists and engineers run batch and streaming workloads. As our subscribers grow worldwide and Netflix enters the world of gaming , the number of batch workflows and real-time data pipelines increases rapidly.

Big Data

Big Data Infrastructure Metrics Hardware

The history of Grail: Why you need a data lakehouse

Dynatrace

OCTOBER 4, 2022

Some time ago, at a restaurant near Boston, three Dynatrace colleagues dined and discussed the growing data challenge for enterprises. At its core, this challenge involves a rapid increase in the amount—and complexity—of data collected within a company. Work with different and independent data types. Thus, Grail was born.

Artificial Intelligence

Artificial Intelligence Analytics Storage Architecture

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

Greenplum Database is a massively parallel processing (MPP) SQL database that is built and based on PostgreSQL. It can scale towards a multi-petabyte level data workload without a single issue, and it allows access to a cluster of powerful servers that will work together within a single SQL interface where you can view all of the data.

Big Data

Big Data Database Artificial Intelligence Open Source

Upgrade to the Data explorer to level up your data visualizations and analysis

Dynatrace

SEPTEMBER 14, 2022

As an industry leader, Dynatrace promotes primarily using software and AI to deal with this complexity at scale instead of just putting data on dashboards. Does that mean that reactive and exploratory data analysis, often done manually and with the help of dashboards, are dead? Why today’s data analytics solutions still fail us.

Metrics

Metrics Analytics Monitoring Efficiency

Cutting Big Data Costs: Effective Data Processing With Apache Spark

Batch Processing for Data Integration

Trending Sources

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

Processing Time Series Data With QuestDB and Apache Kafka

Exploring Parallel Processing: SIMD vs. MIMD Architectures

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Business Flow: Why IT operations teams should monitor business processes

Improving customer experience with business process monitoring

Medallion Architecture: Efficient Batch and Stream Processing Data Pipelines With Azure Databricks and Delta Lake

Stream Processing vs. Batch Processing: What to Know

Dynatrace completed Data Privacy Framework self-certification

Practical business process monitoring for real-time business observability

Unlock the observability value of log data with processing at scale

Enhance data collection with Dynatrace OpenTelemetry Collector distribution

Batch Processing vs. Stream Processing: Why Batch Is Dying and Streaming Takes Over

Our First Netflix Data Engineering Summit

Financial Data Engineering in SAS

Data Integration in Real-Time Systems

Exploring Apache Airflow for Batch Processing Scenario

1. Streamlining Membership Data Engineering at Netflix with Psyberg

AI techniques enhance and accelerate exploratory data analytics

Performance Optimization in ETL Processes

Data Reprocessing Pipeline in Asset Management Platform @Netflix

How TripleLift Built an Adtech Data Pipeline Processing Billions of Events Per Day

How a data lakehouse brings data insights to life

Building an Optimized Data Pipeline on Azure Using Spark, Data Factory, Databricks, and Synapse Analytics

Introducing Dynatrace built-in data observability on Davis AI and Grail

Measuring the importance of data quality to causal AI success

Managing Data Residency: Concepts and Theory

Visualizing IoT Data With MQTT, QuestDB, and Grafana

Enhance data collection with Dynatrace OTel Collector distribution

Edge Data Platforms, Real-Time Services, and Modern Data Trends

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Key Advantages of DBMS for Efficient Data Management

MongoDB Rollback: How to Minimize Data Loss

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

Data privacy by design: How an observability platform protects data security

Boost DevOps maturity with observability and a data lakehouse

Overcoming Challenges and Best Practices for Data Migration From On-Premise to Cloud

Batch Processing: 4 Tactics to Make It Cost-Efficient and Reliable

Auto-Diagnosis and Remediation in Netflix Data Platform

The history of Grail: Why you need a data lakehouse

What is Greenplum Database? Intro to the Big Data Database

Upgrade to the Data explorer to level up your data visualizations and analysis

Stay Connected