Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Is there any way to get wikipedia pageview statistics per page at the *country* grain (instead of simply language)?

I see dumps.wikimedia.org/other/pagecounts-raw/, for example, but no country-specific data there...

like image 948
Kevin Wylie Avatar asked Feb 11 '15 21:02

Kevin Wylie


People also ask

How can I see how many visits a Wikipedia page has?

The pageview stats tool is available from any page, in two ways: 1) Click "Page information" under "Tools" in the sidebar and then "Page view statistics" at the bottom. 2) Click the history tab and then "Pageviews" near the top.

Is Wikipedia API free?

Wikipedia and other Wikimedia projects are free, collaborative repositories of knowledge, written and maintained by volunteers from around the world. The Wikimedia API gives you open access to add this free knowledge to your projects and apps.

How many hits does Wikipedia get a day?

According to publicly disclosed figures, the English version of the website averages about 255 million pageviews a day. According to web analytics firm SimilarWeb, Wikipedia overall is the eighth most visited website.

How do I get a content page on Wikipedia?

There are three main methods for retrieving page content via the API: Get the contents of a page using the Revisions API (as wikitext). Get the contents of a page using the Parse API (as HTML or wikitext). Get plain text or limited HTML extracts of a page using the API of the TextExtracts extension.


1 Answers

As far as I know, there isn't. For obvious privacy reasons, the published page view statistics don't include any IP address information, so there's no way to tell where the requests are coming from.

That said, I suppose it might be possible for the WMF Analytics Team to provide such statistics, or at least some rough approximation of them, if enough people were to request them. Apparently, Wikimedia does already have an IP geolocation database that they use for other purposes (e.g. local event notifications and fundraising campaigns), so that part might not be as hard as it could otherwise seem. There might still be issues with the sheer request volume and potential privacy leaks (e.g. leaking an editor's country of residence, if they edit a rarely-viewed page that no-one else views withing a reporting interval), but those could presumably be handled, e.g. by only geolocating a statistical sample of all requests, and by lumping together statistics for rarely-viewed pages into a single "other" group.

I suspect the main question is simply whether Wikimedia, as a mostly volunteer-run non-profit supported by donations, would really consider all the work needed to provide such statistics worthwhile. Still, I don't suppose it could hurt to ask.

like image 125
Ilmari Karonen Avatar answered Dec 06 '22 07:12

Ilmari Karonen