Remove 2008 Remove Big Data Remove Design Remove Systems
article thumbnail

Hacking with AWS at The Next Web Hackaton - All Things Distributed

All Things Distributed

Werner Vogels weblog on building scalable and robust distributed systems. The images from the 2008 TNW Conference have travelled around the world in my Animoto demo: This year TNW is showing that it is not just a conference for talkers but also for builders by organizing a massive Hackaton in the two days running up to the conference.

AWS 93
article thumbnail

Structural Evolutions in Data

O'Reilly

Each time, the underlying implementation changed a bit while still staying true to the larger phenomenon of “Analyzing Data for Fun and Profit.” ” They weren’t quite sure what this “data” substance was, but they’d convinced themselves that they had tons of it that they could monetize.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Data Mining Problems in Retail

Highly Scalable

Most of this article represents an overview of the results published by retailers and researchers who built practical decision making and optimization systems combining abstract economic models with data mining methods. The design of the model heavily depends on the problem. Propensity to category expansion. Propensity to churn.

Retail 152
article thumbnail

Amazon Cloudfront is Streaming Media 2010 Editor's pick - All.

All Things Distributed

Werner Vogels weblog on building scalable and robust distributed systems. Here are the laurels given by the editors: Debuting in November 2008, Amazons entry into the CDN market quickly became a major player. a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. All Things Distributed.

Media 60
article thumbnail

Should You Use ClickHouse as a Main Operational Database?

Percona

In a partitioned massively parallel database system, the storage format and sorting algorithm may not be optimized for that operation as we are reading multiple partitions in parallel. ClickHouse is a great massively parallel analytical system. At the same time, it was not originally designed that way. Conclusion.