Word-level edit distance of a sentence

1 Answers

In general, this is called the sequence alignment problem. Actually it does not matter what entities you align - bits, characters, words, or DNA bases - as long as the algorithm works for one type of items it will work for everything else. What matters is whether you want global or local alignment.

Global alignment, which attempt to align every residue in every sequence, is most useful when the sequences are similar and of roughly equal size. A general global alignment technique is the Needleman-Wunsch algorithm algorithm, which is based on dynamic programming. When people talk about Levinstain distance they usually mean global alignment. The algorithm is so straightforward, that several people discovered it independently, and sometimes you may come across Wagner-Fischer algorithm which is essentially the same thing, but is mentioned more often in the context of edit distance between two strings of characters.

Local alignment is more useful for dissimilar sequences that are suspected to contain regions of similarity or similar sequence motifs within their larger sequence context. The Smith-Waterman algorithm is a general local alignment method also based on dynamic programming. It is quite rarely used in natural language processing, and more often - in bioinformatics.

answered Sep 21 '22 06:09

Alexander Solovets

Related questions
                            
                                Meaning of "Runtime Environment" and of "Software framework"?
                            
                                Django AdminSite/ModelAdmin for end users?
                            
                                Need to avoid subprocess deadlock without communicate
                            
                                Mutability and Spring
                            
                                Many-to-many relationships in DDD
                            
                                What is the usefulness of NMTOKEN and NMTOKENS types?
                            
                                Explicit loading of grandchild collections in EF 4.1
                            
                                How to recalculate axis-aligned bounding box after translate/rotate?
                            
                                Eclipse autocomplete irritation
                            
                                Search and Replace Words in HTML
                            
                                The roadmap to an Android development expert [closed]
                            
                                Memory-usage of dictionary in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Word-level edit distance of a sentence

Tags:

AutoC

People also ask

1 Answers

Alexander Solovets

Recent Activity

Donate For Us