Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

how to get database of all people's names (or at least English common ones)?

Tags:

string

I'm designing an application that need to extract people's names from short texts.

What is the best way to do that? is there a database of names where I can test to know where is the name? the fact that the text is short it might not be as intensive in terms of processing needs.

Any ideas?

Thanks,

Tam

like image 902
Tam Avatar asked Nov 14 '09 22:11

Tam


1 Answers

You can use a statistical Named Entity Recognizer (NER), such as Stanford's NER, or LingPipe's. These are machine learning-based recognizers, that do not require huge dictionaries of names as input.

Alternatively, you can get a list of person names from the Web (there are plenty), and use the Aho-Corasick string searching algorithm to efficiently extract names from the list from text.

like image 188
João Silva Avatar answered Oct 01 '22 06:10

João Silva