Calculating a histogram on a streaming data - Online histogram calculation

1 Answers

I just found one solution. Sec. 2.2 of "On-line histogram building from A streaming parallel decision tree algorithm" paper. The algo is implemented by NumericHistogram class in Hive project :

A generic, re-usable histogram class that supports partial aggregations. The algorithm is a heuristic adapted from the following paper: Yael Ben-Haim and Elad Tom-Tov, "A streaming parallel decision tree algorithm", J. Machine Learning Research 11 (2010), pp. 849--872. Although there are no approximation guarantees, it appears to work well with adequate data and a large (e.g., 20-80) number of histogram bins.

104

answered Sep 20 '22 15:09

Ali Salehi

Related questions
                            
                                Constant-Time comparison [closed]
                            
                                Placing archers on wall
                            
                                How to implement a constraint solver for 2-D geometry?
                            
                                Coming up with an algorithm in O(n)
                            
                                Distributed cross correlation matrix computation
                            
                                Predicting phrases instead of just next word
                            
                                Find two elements with smallest absolute difference in an interval
                            
                                Count of co-prime pairs from two arrays in less than O(n^2) complexity
                            
                                minimum number of rectangular regions to fill a grid
                            
                                Clustering Algorithm with discrete and continuous attributes?
                            
                                Algorithm to determine the effective "phase difference" between two signals with different frequencies?
                            
                                Saddle roof algorithm
                            
                                Dijkstra shortest path algorithm with edge cost
                            
                                Sorting A List Of Songs By Popularity
                            
                                How to find nearest vector in {0,1,2}^12, over and over again
                            
                                How to compute palindrome from a stream of characters in sub-linear space/time?
                            
                                How to quickly search book titles?
                            
                                Divide and conquer, dynamic programming and greedy algorithms!
                            
                                Fast ellipsoid(s) intersection algorithm
                            
                                Metric for SURF

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Calculating a histogram on a streaming data - Online histogram calculation

Tags:

algorithm

stream

statistics

streaming

Ali Salehi

People also ask

1 Answers

Ali Salehi

Recent Activity

Donate For Us