Abstract:This work introduces ExaLogLog, a new data structure for approximate distinct counting, which has the same practical properties as the popular HyperLogLog algorithm. It is commutative, idempotent, mergeable, reducible, has a constant-time insert operation, and supports distinct counts up to the exa-scale. At the same time, as theoretically derived and experimentally verified, it requires 43% less space to achieve the same estimation error.
Submission history
From: Otmar Ertl [view email]
[v1]
Wed, 21 Feb 2024 11:39:33 UTC (7,540 KB)
[v2]
Thu, 27 Feb 2025 14:08:15 UTC (12,643 KB)