[Hashing Home][Home]
HyperLogLog [1] provides a way to estimating the number of distinct values (cardinality) in a data set. In this case we will use a HyperLogLog method, and compare with the creation of a dataset in Python. For example, with a string of "One one two one two" we have a cardinality of 3 ("One", "one" and "two").