Text difference algorithm

Tags:

I need an algorithm that can compare two text files and highlight their difference and ( even better!) can compute their difference in a meaningful way (like two similar files should have a similarity score higher than two dissimilar files, with the word "similar" defined in the normal terms). It sounds easy to implement, but it's not.

The implementation can be in c# or python.

Thanks.

369

asked Sep 28 '08 10:09

Graviton

1 Answers

I can recommend to take a look at Neil Fraser's code and articles:

google-diff-match-patch

Currently available in Java, JavaScript, C++ and Python. Regardless of language, each library features the same API and the same functionality. All versions also have comprehensive test harnesses.

Neil Fraser: Diff Strategies - for theory and implementation notes

131

answered Sep 23 '22 01:09

aku

Related questions
                            
                                MVC 4 - how do I pass model data to a partial view?
                            
                                For vs. Linq - Performance vs. Future
                            
                                Hiding table border in iTextSharp
                            
                                Convert byte array to image in wpf
                            
                                Return list from async/await method
                            
                                "Nested foreach" vs "lambda/linq query" performance(LINQ-to-Objects) [closed]
                            
                                Remove readonly attribute from directory
                            
                                Converting HTML entities to Unicode Characters in C#
                            
                                How can I export a GridView.DataSource to a datatable or dataset?
                            
                                How to implement "Access-Control-Allow-Origin" header in asp.net
                            
                                How to set all bits of enum flag
                            
                                How to add new DataRow into DataTable?
                            
                                internal member in an interface
                            
                                Why we can't have "char" enum types
                            
                                How do I make a form modal in Windows Forms?
                            
                                Generate number sequences with LINQ
                            
                                how to response.write bytearray?
                            
                                how to load all assemblies from within your /bin directory
                            
                                Verifying a delegate was called with Moq
                            
                                Web API complex parameter properties are all null

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Text difference algorithm

Tags:

python

c#

diff

Graviton

People also ask

1 Answers

aku

Recent Activity

Donate For Us