article thumbnail

Kubernetes for Big Data Workloads

Abhishek Tiwari

Kubernetes has emerged as go to container orchestration platform for data engineering teams. In 2018, a widespread adaptation of Kubernetes for big data processing is anitcipated. Organisations are already using Kubernetes for a variety of workloads [1] [2] and data workloads are up next. Performance.

article thumbnail

Advancing Application Performance with NVMe Storage, Part 3

DZone

NVMe Storage Use Cases. NVMe storage's strong performance, combined with the capacity and data availability benefits of shared NVMe storage over local SSD, makes it a strong solution for AI/ML infrastructures of any size. There are several AI/ML focused use cases to highlight.

Storage 100
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

In-Stream Big Data Processing

Highly Scalable

The shortcomings and drawbacks of batch-oriented data processing were widely recognized by the Big Data community quite a long time ago. The pipelines can be stateful and the engine’s middleware should provide a persistent storage to enable state checkpointing. High performance and mobility.

Big Data 154
article thumbnail

Advancing Application Performance With NVMe Storage, Part 2

DZone

For example, one well-respected vendor's standard solution is limited to 7.5TB of internal storage, and it can only scale to 30TB.

Storage 100
article thumbnail

Advancing Application Performance with NVMe Storage, Part 1

DZone

With big data on the rise and data algorithms advancing, the ways in which technology has been applied to real-world challenges have grown more automated and autonomous. Financial analysis with real-time analytics is used for predicting investments and drives the FinTech industry's needs for high-performance computing.

article thumbnail

How to Optimize Elasticsearch for Better Search Performance

DZone

These processes are only possible with a distributed architecture and parallel processing mechanisms that Big Data tools are based on. One of the top trending open-source data storage that responds to most of the use cases is Elasticsearch.

Big Data 157
article thumbnail

What Should You Know About Graph Database’s Scalability?

DZone

It has been a norm to perceive that distributed databases use the method of adding cheap PC(s) to achieve scalability (storage and computing) and attempt to store data once and for all on demand. However, doing the same cannot achieve equivalent scalability without massively sacrificing query performance on graph systems.