Measuring semantic similarity between two phrases [closed]

Tags:

I want to measure semantic similarity between two phrases/sentences. Is there any framework that I can use directly and reliably?

I have already checked out this question, but its pretty old and I couldn't find real helpful answer there. There was one link, but I found this unreliable.

e.g.:
I have a phrase: felt crushed
I have several choices: force inwards,pulverized, destroyed emotionally, reshaping etc.
I want to find the term/phrase with highest similarity to the first one.
The answer here is: destroyed emotionally.

The bigger picture is: I want to identify which frame from FrameNet matches to the given verb as per its usage in a sentence.

Update : I found this library very useful for measuring similarity between two words. Also the ConceptNet similarity mechanism is very good.

and this library for measuring semantic similarity between sentences

If anyone has any insights please share.

925

asked Apr 25 '13 01:04

voidMainReturn

1 Answers

This is a very complicated problem.

The main technique that I can think of (before going into more complicated NLP processes) would be to apply cosine (or any other metric) similarity to each pair of phrases. Obviously this solution would be very inefficient at the moment due to the non-matching problem: The sentences might refer to the same concept with different words.

To solve this issue, you should transform the initial representation of each phrase with a more "conceptual" meaning. One option would be to extend each word with its synonyms (i.e. using WordNet, another option is to apply metrics such as distributional semantics DS (http://liawww.epfl.ch/Publications/Archive/Besanconetal2001.pdf) that extend the representation of each term with the more likely words to appear with it.

Example: A representation of a document: {"car","race"} would be transform to {"car","automobile","race"} with synonyms. While, with DS it would be something like: {"car","wheel","road","pilot", ...}

Obviously this transformation won't be binary. Each term will have some associated weights.

I hope this helps.

190

answered Oct 19 '22 05:10

miguelmalvarez

Related questions
                            
                                How to limit download rate of HTTP requests in requests python library?
                            
                                Why does the type System.__ComObject claim (sometimes) to be public when it is not?
                            
                                How to manage the error queue in openssl (SSL_get_error and ERR_get_error)
                            
                                Rails 4: How to upload files with AJAX
                            
                                Find nearest edge in graph
                            
                                Git file write error (No space left in device)
                            
                                Why does tree vectorization make this sorting algorithm 2x slower?
                            
                                UDP Packet drop - INErrors Vs .RcvbufErrors
                            
                                Python: understanding (None for g in g if (yield from g) and False)
                            
                                Wrap text from bottom to top
                            
                                How to check security acess (@Secured or @PreAuthorize) before validation (@Valid) in my Controller?
                            
                                Custom extension file not opening in iMessage

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With