


Manymonkeys.co.nz is used for special development purposes:
Enjoy a little humour with us!
The Mathematics of Monkeys and Shakespeare:
Information is the product of intelligence, not chance.
I don't know who it was that first talked about the possibility of monkeys typing randomly on typewriters producing Hamlet entirely by chance, "but given enough time and enough monkeys, it would happen."
This argument is actually quite sound, given enough time and enough monkeys, because it is one of those rare areas of speculation where the exact figures can be calculated.
The monkeys and typewriters scenario sounds possible until you examine the math.
Monkeys Produce Hamlet: Feasibility Study
Let's imagine a very simple typewriter that has only the 26 uppercase letters, a space bar and five punctuation characters (a total of 32 buttons) and an infinite roll of paper being fed through it. We have a monkey that knows how to press the keys and will do so in a totally random manner indefinitely. We want our monkey to type the following snippet:
"TO BE OR NOT TO BE, THAT IS THE QUESTION."
The probability of this happening is quite simple to calculate. Our monkeys are fast typists and can type the required number of characters in a single second (there are 41 keystrokes)! On average, how long will it be before one of our monkeys produces Hamlet's famous line? < br>
Well, there are 32 keys, so starting at any moment, the chances of our monkey getting the first keypress right are one in 32. Not good, but we have fast monkeys and lots of time. However, once it has got the first keystroke right, we also need the second keystroke to be right, otherwise we are back to square one. The chances of it getting the first and second keystrokes right are only one in (32*32 = 1024). Only one chance in 1024, but still lots of time to get it right. To get the first three characters right will be a one in (32*32*32 = 32768) chance. Each time it presses a key, there is a one in 32 chance that it will be correct. To get our little snippet of Hamlet, it will need a total of 41 consecutive "correct" keystrokes. This means that the chances are one in 32 to the power of 41.
32^41 = 5.142201741629e+061
Okay, so these figures are pretty vast, but we have a lot of monkeys and they can type fast. So how long will it take, on average, for one of my monkeys to type a line matching that sentence? Hard question. Let's get an idea of how long we are talking here. How many lines can my monkey type in a year, given that it types at a rate of one line per second?
1 line per second
* 60 seconds per minute = 60 lines per minute
* 60 minutes per hour = 3600 lines per hour
* 24 hours per day = 86400 lines per day
* 365.24 days per year = 31556736 lines per year
Okay, now for the moment of truth. We know how many possible different lines can be produced, hence how likely it is for us to get the right one at random (because only one is right). We can calculate the chances of getting the quote in a year most easily by calculating the chances of missing on every attempt: the chances of getting the quote will be 100% minus the chances of missing on every attempt. The calculation is as follows.
probability of missing on one attempt = 1  1/(32^41)
...of missing for a minute straight = (1  1/(32^41)) ^ 60
...of missing for an hour straight = ((1  1/(32^41)) ^ 60) ^ 60
...of missing for a day straight = (((1  1/(32^41)) ^ 60) ^ 60) ^ 24
...for a year straight = ((((1  1/(32^41)) ^ 60) ^ 60) ^ 24) ^ 365
The answer looks like this:
0.999999999999999999999999999999999999999999999999999999386721844366784484760952487499968756116464000
Okay, so realistically, there is no way that our monkey can do its job in a year. Maybe we should start talking centuries? Millenia? As I understand it, common scientific wisdom suggests that the universe is about 15 billion years old. We can easily extend our current figure of one year to count many years. Our calculator will be much faster if we break the calculation down to powers of two and just use the "square" operation, so let's choose a nice even power of two like 2^34, which is about 17 billion (17,179,869,184 to be precise). The new figure is:
0.999999999999999999999999999999999999999999989463961512816564762914005246488858434168051444149065728
The chances of failure are still essentially 100%, even after 2^34 years. Hmmm. It doesn't look like were are going to get very far with this, but just for the heck of it, let's see if we are any better off with a lot of monkeys. Let's not hold back here  let's hypothesize 17 billion galaxies, each containing 17 billion habitable planets, each planet with 17 billion monkeys each typing away and producing one line per second for 17 billion years. What are the chances of the phrase "TO BE OR NOT TO BE, THAT IS THE QUESTION." not being included in the output?
0.999999999999946575937950778196079485682838665648264132188104299326596142975867879656916416973433628
It's about 99.999999999995% sure that they would fail to produce the sentence. Are you astounded? It's such a trivial requirement, right? Just one puny sentence. And yet the figures keep coming up "impossible". Where have we made a mistake? We have failed to appreciate the sheer magnitude of the problem. Let's look at it one more time.
The number of 41character strings that are possible with a 32character alphabet is 32^41.
51422017416287688817342786954917203280710495801049370729644032
In case you don't feel like counting, this value is 62 digits long. In our hypothesising above, we imagined 17 billion galaxies, each with 17 billion planets, each with 17 billion monkeys, each of which was producing a line of text per second for 17 billion years. How many lines of text did we wind up producing in this experiment? The math is as follows:
2^34 * 2^34 * 2^34 * 2^34 * 365 * 24 * 60 * 60
And the answer is as follows:
2747173049143991138247931294711870033017962496000
Once again, in case you don't feel like counting, the answer is 49 digits long. Now, there is no guarantee that our monkeys are going to type something different every time, but even if we managed to rig up the experiment so that they never tried the same thing twice, they have still only produced 1/18,718,157,355,362 of the possible alternatives. The denominator in that fraction is 14 digits long, by the way. It's a figure that's vastly bigger than anything you would come across in the real world. Is it any wonder, in light of that, that it is so damn hard to get the right answer by accident?
Conclusion: Information is the product of intelligence, not chance.
Edited from "A Constrained Rant" by "The Famous Brett Watson", 1995




