article thumbnail

Data Storage Formats for Big Data Analytics: Performance and Cost Implications of Parquet, Avro, and ORC

DZone

One key factor that significantly affects the performance of data processing is the storage format of the data. This article explores the impact of different storage formats, specifically Parquet, Avro, and ORC on query performance and costs in big data environments on Google Cloud Platform (GCP).

Big Data 274
article thumbnail

Storage Types Used on Cloud Computing Platforms

DZone

Because of the emergence of cloud services, a broad range of storage choices are now easily available to fulfill the different demands of both organizations and people. These storage alternatives have been designed to meet a range of requirements, including performance, scalability, durability, and price.

Storage 272
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Introduction to Azure Data Lake Storage Gen2

DZone

Built on Azure Blob Storage, Azure Data Lake Storage Gen2 is a suite of features for big data analytics. Azure Data Lake Storage Gen1 and Azure Blob Storage's capabilities are combined in Data Lake Storage Gen2. For instance, Data Lake Storage Gen2 offers scale, file-level security, and file system semantics.

Azure 247
article thumbnail

A Comprehensive Guide: Installing Docker, Running Containers, Managing Storage, and Setting up Networking

DZone

This comprehensive guide will walk you through the crucial steps of setting up networking, managing storage, running containers, and installing Docker. Thanks to Docker, a leading containerization platform, applications can be packaged and distributed more easily in portable, isolated environments.

Storage 279
article thumbnail

How We Built a High-Performance Storage Layer for Our Ultra-Heterogeneous Computing Cluster

DZone

Finding a storage solution for our ultra-heterogeneous computing cluster was challenging. We tried two solutions: object storage with s3fs + network-attached storage (NAS) and Alluxio + Fluid + object storage , but they had limitations and performance issues.

Storage 227
article thumbnail

How To Debug Mobile App Database Problems and Optimize Data Storage Performance

DZone

However, lurking beneath the surface lies a complex web of data storage and retrieval. That's why knowing how to debug mobile app database problems and optimize data storage performance is essential for developers seeking excellence. In the dynamic realm of mobile app development , a flawless user experience is the ultimate goal.

Storage 201
article thumbnail

Introduction to Kafka Tiered Storage at Uber

Uber Engineering

Kafka Tiered Storage, developed in collaboration with the Apache Kafka community, introduces the separation of storage and processing in brokers, significantly improving the scalability, reliability, and efficiency of Kafka clusters.

Storage 75