article thumbnail

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

Greenplum Database is a massively parallel processing (MPP) SQL database that is built and based on PostgreSQL. It can scale towards a multi-petabyte level data workload without a single issue, and it allows access to a cluster of powerful servers that will work together within a single SQL interface where you can view all of the data.

Big Data 321
article thumbnail

In-Stream Big Data Processing

Highly Scalable

The shortcomings and drawbacks of batch-oriented data processing were widely recognized by the Big Data community quite a long time ago. It is clear that distributed in-stream data processing has something to do with query processing in distributed relational databases. Basics of Distributed Query Processing.

Big Data 154
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

What Should You Know About Graph Database’s Scalability?

DZone

Having a distributed and scalable graph database system is highly sought after in many enterprise scenarios. Do Not Be Misled Designing and implementing a scalable graph database system has never been a trivial task.

article thumbnail

Data Engineers of Netflix?—?Interview with Kevin Wylie

The Netflix TechBlog

I was later hired into my first purely data gig where I was able to deepen my knowledge of big data. After that, I joined MySpace back at its peak as a data engineer and got my first taste of data warehousing at internet-scale. Both were appliances located in our own data center.

article thumbnail

Driving down the cost of Big-Data analytics - All Things Distributed

All Things Distributed

Driving down the cost of Big-Data analytics. The Amazon Elastic MapReduce (EMR) team announced today the ability to seamlessly use Amazon EC2 Spot Instances with their service, significantly driving down the cost of data analytics in the cloud. a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications.

Big Data 112
article thumbnail

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

Dynatrace

Hybrid cloud architecture is a computing environment that shares data and applications on a combination of public clouds and on-premises private clouds. Public cloud refers to on-demand infrastructure and services provided by a third party over the public internet. With public clouds, multiple organizations share resources.

article thumbnail

Job Openings in AWS - Senior Leader in Database Services - All.

All Things Distributed

Job Openings in AWS - Senior Leader in Database Services. This week it is an opening for senior leaders with AWS Database Services. AWS Database Services is responsible for setting the database strategy and delivering distributed structured storage services to our AWS customers. Comments (). Contact Info. Werner Vogels.

AWS 77