aboutsummaryrefslogblamecommitdiff
path: root/devel/py-ddsketch/pkg-descr
blob: a3621b68d57c2aa0e872b3274e8e4a58a7393bc3 (plain) (tree)


















                                                                                
This is the Python implementation of the distributed quantile sketch algorithm
DDSketch. DDSketch has relative-error guarantees for any quantile q in [0, 1].
That is if the true value of the qth-quantile is x then DDSketch returns a value
y such that |x-y| / x < e where e is the relative error parameter. (The default
here is set to 0.01.) DDSketch is also fully mergeable, meaning that multiple
sketches from distributed systems can be combined in a central node.

Our default implementation, DDSketch, is guaranteed to not grow too large in
size for any data that can be described by a distribution whose tails are
sub-exponential.

We also provide implementations (LogCollapsingLowestDenseDDSketch and
LogCollapsingHighestDenseDDSketch) where the q-quantile will be accurate up to
the specified relative error for q that is not too small (or large). Concretely,
the q-quantile will be accurate up to the specified relative error as long as it
belongs to one of the m bins kept by the sketch. If the data is time in seconds,
the default of m = 2048 covers 80 microseconds to 1 year.

WWW: https://github.com/DataDog/sketches-py