I'm designing an application that need to extract people's names from short texts.
What is the best way to do that? is there a database of names where I can test to know where is the name? the fact that the text is short it might not be as intensive in terms of processing needs.
Any ideas?
Thanks,
Tam
You can use a statistical Named Entity Recognizer (NER), such as Stanford's NER, or LingPipe's. These are machine learning-based recognizers, that do not require huge dictionaries of names as input.
Alternatively, you can get a list of person names from the Web (there are plenty), and use the Aho-Corasick string searching algorithm to efficiently extract names from the list from text.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With