# Wikipedia's fixed point

(by Bob West, May 26, 2011)

The other day my friend Peter pointed me towards this xkcd comic: http://xkcd.com/903/

The interesting part is actually hidden in the rollover (the text that appears when you point your mouse cursor over the image). It says,

"Wikipedia trivia: if you take any article, click on the first link in the article text not in parentheses or italics, and then repeat, you will eventually end up at 'Philosophy'."

Peter mused, "Now is that intentionally engineered into the system, or does it have to do with philosophy being a conceptual attractor to everything?"

This is where Wikipedia-based procrastination got me again. I spent a day (a day I should normally have spent working) thinking about and experimenting with this. So I thought I might as well share the results.

Here is a picture that captures the scenario:

(I made a little mistake drawing the picture: I included the article "Vertebrate" twice; "Boa" has to link directly to "Mammal" in the inner cycle.)

In this picture, F0 is the set of all Wikipedia articles. Each article has exactly one "first-link successor", marked by an arrow from the article to its successor. F1 is the subset of all Wikipedia articles (i.e., of F0) consisting of all articles that are first-link successors of any Wikipedia article (i.e., of any element of F0). F2 is the set of all first-link successors of the articles in F1, F3 is the set of all first-link successors of F2, and so forth, up to a point FN where all successors of FN are themselves in FN (in other words, FN+1 = FN), such that further applying the successor function does not shrink the set of articles any more. This irreducible core FN necessarily consists of cycles only.

The article about philosophy is in one of these cycles. Whatever article you start from, you will in a few hops traverse the layers shown in the picture. You'll probably not hit each single one of them, but you'll always gravitate from the outside towards the center of the picture, finally ending up in one of the cycles of the irreducible core. Now, when you continue clicking, you'll hit the same articles over and over again, in the same order. That means that, if the cycle you hit is the one of which "Philosophy" is a member, you'll eventually end up at "Philosophy".

As a concrete example of manageable size, let's look at a small version of Wikipedia (the one also used in Wikispeedia), with which you can play here. These are the numbers: There are 4,604 articles in total, and there are 19 distinct layers F0 to F18 (cf. the picture), i.e., after at most 18 clicks, you'll hit a cycle. There are 38 cycles containing a total of only 88 articles (out of all the 4,604 articles!). The biggest cycle contains 11 articles:

[Culture, Anthropology, Human, Mammal, Vertebrate, Cambrian_explosion, Fossil, Mineral, Geology, Science, Society],

the next biggest one has 5 members:

[Economics, Education, Family, Sociology, Globalization],

and all remaining 36 cycles have only two members, each member having the other as its first-link successor, e.g. [Cotton, Textile]. So after at most 18 clicks, you are in one of the 88 articles of the irreducible core, and there's a big chance the cycle you're in is the large one with 11 elements. If that's the case, by clicking at most 10 more times you'll hit a fixed "magic" article, such as "Anthropology". Note that in this small version of Wikipedia (as opposed to "real" Wikipedia), you probably shouldn't pick "Philosophy" as your magic article, since it's not in the big cycle but in a tiny two-element cycle [Philosophy, Logic].

I pasted the data resulting from my experiment at the bottom of this page. (With these data you could in principle complete the above picture for the concrete case of the small Wikipedia version.)

By the way, it turns out there is a card trick that works exactly the same way. It's called the "Kruskal Count", and you can find a nice demo of it here.

So let's wrap up by returning to Peter's question (you realize how this post is itself a cycle from Peter back to Peter...) whether the phenomenon from the xkcd comic is (1) "intentionally engineered into the system" or (2) caused by "philosophy being a conceptual attractor to everything". The answer is that both play a role: (1) It only works because "Philosophy" is part of one of the core cycles, so to an extent the effect is engineered into the structure of Wikipedia's hyperlink network (mostly not intentionally, although some people have tinkered with articles in order to make or break the xkcd effect). (2) The first paragraph of Wikipedia articles typically contains a definition, and as my friend Ashton put it, by following the first link over and over again, you act like a little child that keeps asking "But why?". So you're gravitating towards the "Philosophy" cycle most of time (and not to one of the other existing cycles), since the ultimate explanation of things is what philosophers are dealing with.

"But why?"

Layers and core cycles of the small Wikipedia version:

