Probabilistic Data Structures for Web Analytics and Data Mining
Highly Scalable
MAY 1, 2012
Let us start with a simple example that illustrates capabilities of probabilistic data structures: Let us have a data set that is simply a heap of ten million random integer values and we know that it contains not more than one million distinct values (there are many duplicates). what is the cardinality of the data set)?
Let's personalize your content