I want to implement a python based semantic search over a set of keywords (mainly hobbies, latest news, etc which people might be interested to talk about). I want to know if there exist an ontology database(s) for the same and open source search algorithms/implementations for the same.
Eg. My set = {talking,drinking,tweeting,Katrina Kaif,Katrina cyclone,rock collecting,coin collecting}
So, on searching "accumulate" I might get rock collection and coin collecting as the output.
Edit : The terms can have multiple words. That is "President. Barack Obama of United States" is a valid query.
You might want to use "random indexing". It can do exactly what you need, it calculates a feature vector for each word and defines a metric of semantic similarity between two words.
All you need is to grab a copy of An Introduction to Random Indexing and a semanticvectors package to get you started ...
I hope this helps, if you need further advice, please comment ...
I hope but am not sure if this is helpful to you.
Gnowsys
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With