Cluster one-dimensional data optimally? [closed]

1 Answers

Univariate k-means clustering can be solved in O(kn) time (on already sorted input) based on theoretical results on Monge matrices, but the approach was not popular most likely due to numerical instability and also perhaps coding challenges.

A better option is an O(knlgn) method that is now implemented in Ckmeans.1d.dp version 3.4.6. This implementation is as fast as heuristic k-means but offers guaranteed optimality, orders of magnitude better than heuristic k-means especially for large k's.

The generic dynamic programming solution by Richard Bellman (1973) does not touch upon specifics of the k-means problem and the implied runtime is O(kn^3).

170

answered Oct 04 '22 12:10

user6417312

Related questions
                            
                                Plot polynomial regression curve in R
                            
                                Random forest output interpretation
                            
                                R data.table apply function to rows using columns as arguments
                            
                                data.table - select first n rows within group [duplicate]
                            
                                using substitute to get argument name with
                            
                                Sink does not release file
                            
                                How to count the number of unique values by group? [duplicate]
                            
                                Remove fill around legend key in ggplot
                            
                                How to open CSV file in R when R says "no such file or directory"?
                            
                                How to get unsaved script tabs
                            
                                Replace multiple strings in one gsub() or chartr() statement in R?
                            
                                Angle between two vectors in R
                            
                                Take randomly sample based on groups
                            
                                is there a way to extend LETTERS past 26 characters e.g., AA, AB, AC...?
                            
                                Easier way to plot the cumulative frequency distribution in ggplot?
                            
                                Add error bars to show standard deviation on a plot in R
                            
                                What programming languages are good for statistics? [closed]
                            
                                Removing html tags from a string in R
                            
                                R-Project no applicable method for 'meta' applied to an object of class "character"
                            
                                Safely creating S3 Generics in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cluster one-dimensional data optimally? [closed]

Tags:

r

cluster-analysis

k-means

Laciel

People also ask

1 Answers

user6417312

Recent Activity

Donate For Us