I have a collection of sentences, and I need to analyse them to see how similar they are. Are there any established algorithms to do this? I care about: <ul> <li>containing the same words (ignoring inflexions for now)</li> <li>containing the same words in a similar order</li> </ul> I've used Levenshtein distance and n-grams for spelling before, although I'm not entirely confident if these translate to my purposes. Naively, "I don't care about spelling differences, typos can be treated as different words" although perhaps it would be nice to account for this. perhaps some hybrid of splitting the sentence at spaces and one of the above (or other) algorithms would be a starting point What options are available? Any advice? Thanks!

This paper compares several sentence similarity measures. Perhaps you can use one of them as is, or modify it for your needs. Otherwise sentence similarity measure is a good key term to google for.

Algorithm to compare similarity of English sentences

1 Answers

This paper compares several sentence similarity measures. Perhaps you can use one of them as is, or modify it for your needs.

Otherwise sentence similarity measure is a good key term to google for.

172

answered Oct 31 '22 03:10

Szabolcs

Related questions
                            
                                Fastest way to reduce number of latitude and longitude points
                            
                                Hash Collision Linear Probing Running Time
                            
                                Minimax explanation "for dummies"
                            
                                Why solving Knapsack problem is not considered as linear programming?
                            
                                Multiplicative combination algorithm
                            
                                Find all possible combinations of a String representation of a number
                            
                                Minimizing number of crossings in a bipartite graph
                            
                                Find the single wrong element in matrix product?
                            
                                How to find ith item in zigzag ordering?
                            
                                Understanding the time complexity of the Longest Common Subsequence Algorithm
                            
                                Is there a more efficient way to find most common n-grams?
                            
                                Seeking algorithm to invert (reverse? mirror? turn inside-out) a DAG
                            
                                Algorithm that searches for related items based on common tags
                            
                                3d bin packing algorithm [closed]
                            
                                Source code for Xiaolin Wu's line algorithm in C?
                            
                                Generate words that fit in Guids (just for fun)
                            
                                Finding a stable placement of an irregular (non-convex) shape
                            
                                HDR image creating algorithm
                            
                                Finding centre of rotation for a set of points [closed]
                            
                                Optimization from partial solution: minimize sum of distances between pairs

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Algorithm to compare similarity of English sentences

Tags:

algorithm

Andrew Bullock

People also ask

1 Answers

Szabolcs

Recent Activity

Donate For Us