
200
|
Chapter 4, Mapping (on) the Web
#45 Extract a Spatial Model from Wikipedia
HACK
the page name in square brackets [[Like This]]. In Wikipedia, there’s a spe-
cial URL syntax to get a list of which Wikipedia sites link to each Wikipedia
page. For example, http://en.wikipedia.org/wiki/Argentina is the country page
for Argentina, and http://en.wikipedia.org/w/wiki.phtml?title=Special:
Whatlinkshere&target=Argentina is the page showing all the backlinks to
every page that refers to Argentina, shown in Figure 4-17.
Modeling Wikipedia
Implicit in the structure of Wikipedia is kind of spatial index to events and
people and ideas! Some of these links will be to places we can identify and
geocode in their own right: country, cities, towns, regions.
For each country page in Wikipedia, we can build a set of related pages
through backlinks. Some of them are lists in which every country appears;
others are “History of…” and “Politics Of…” pages, sites about towns and
cities, and pages about important dates and people. Each of these pages
links to one or many countries and cities in turn.
Wikipedia has a lot of spatial data in it, including the available reference
data from the CIA World Factbook for each country. Many country pages
have beautifully drawn flat maps. Wikipedia is rich with information about
the government and administration structures of many countries, but it
doesn’t have structured metadata ...