Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Mining Groups of people from Wikipedia

I am trying to get the list of people from the http://en.wikipedia.org/wiki/Category:People_by_occupation . I have to go through all the sections and get people from each section.

How should i go about it ? Should I use a crawler and get the pages and search through those using BeautifulSoup ?
Or is there any other alternative to get the same from Wikipedia ?

like image 296
AlgoMan Avatar asked Dec 21 '25 08:12

AlgoMan


1 Answers

I would go with Pywikipediabot python project.

Have a look to category.py. You could use:

* tree        - show a tree of subcategories of a given category
* listify     - make a list of all of the articles that are in a category
like image 121
systempuntoout Avatar answered Dec 24 '25 08:12

systempuntoout



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!