Line 0
Link Here
|
|
|
1 |
Estimate distinct count of values on the command line |
2 |
|
3 |
The edcount program implements HyperLogLog, with some minor modifications, as |
4 |
detailed by by Flajolet et. al. in the paper "HyperLogLog: the analysis of a |
5 |
near-optimal cardinality estimation algorithm". |
6 |
|
7 |
Additionally, the memory footprint of the program is constant, at a few |
8 |
megabytes. This memory use is constant regardless of the number of records |
9 |
counted, and does not degrade in accuracy. |
10 |
|
11 |
WWW: https://github.com/haroldfreeman/edcount |