Remove c
article thumbnail

MapReduce Patterns, Algorithms, and Use Cases

Highly Scalable

Several practical case studies are also provided. Applications: Log Analysis, Data Querying. Solution: Problem description is split in a set of specifications and specifications are stored as input data for Mappers. Case Study: Simulation of a Digital Communication System. Applications: ETL, Data Analysis.

C++ 144
article thumbnail

Probabilistic Data Structures for Web Analytics and Data Mining

Highly Scalable

This approach often leads to heavyweight high-latency analytical processes and poor applicability to realtime use cases. The picture above depicts the fact that this data set basically occupies 40MB of memory (10 million of 4-byte elements). what is the cardinality of the data set)? Case Study. Case Study.

Analytics 191
article thumbnail

Data Mining Problems in Retail

Highly Scalable

Data mining offers a variety of techniques for nonparametric modeling that helps to create flexible and practical models. Many articles and case studies published during the last decade successfully achieve the balance between abstract models and machine learning. However, many of these models are highly parametric (i.e.

Retail 152