article thumbnail

Data Mining Problems in Retail

Highly Scalable

Retail is one of the most important business domains for data science and data mining applications because of its prolific data and numerous optimization problems such as optimal prices, discounts, recommendations, and stock levels that can be solved using data analysis methods. However, many of these models are highly parametric (i.e.

Retail 152
article thumbnail

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

All Things Distributed

It all started in 2004 when Amazon was running Oracle's enterprise edition with clustering and replication. We were pushing the limits of what was a leading commercial database at the time and were unable to sustain the availability, scalability and performance needs that our growing Amazon business demanded.

Internet 130