ORINGINAL TEXT
(This is the original text before it was edited by Scientific American)
The logic of mathematics sometimes leads to apparently bizarre conclusions. The rule here is that provided the logicdoesn't have holes in it, then the conclusions are sound. If they conflict with your intuition, then either your intuition is okay in a different context --- but not the one underconsideration --- or you need to refine your intuition.In September 1998 Stephen M. Omohundro sent me a puzzle that falls into exactly this category. The puzzle was invented by Steven Landsberg (U Illinois, Urbana-Champaign) ,and Omohundro came up with a variant in which the logic becomes surprisingly convoluted and the conclusions are remarkable.
First, the original version of the puzzle.
Ten pirates have got their hands on a hoard of 100 goldpieces, and wish to divide the loot between them. They are democratic pirates, in their own way, and it is their custom to make such divisions in the following manner. The fiercest pirate makes a proposal about the division, and everybody votes --- one vote each including the proposer. If 50% or more are in favour, the proposal passes and is implemented forthwith. Otherwise the proposer is thrown overboard and the procedure is repeated with the next fiercest pirate.
All the pirates enjoy throwing people overboard, but given the choice they prefer hard cash. They dislike being thrown overboard themselves. All pirates are rational, know that the other pirates are rational, know that they know that... and so on. Unlike other puzzles with an apparently similar form (see Monks, blobs, and common knowledge, Scientific American August 1998, 96-97), this one does not hinge upon the revelation of some piece of 'common knowledge'. Any common knowledge is already commonly known. Moreover, no two pirates are equally fierce, so there is a precise 'pecking order' --- and it is known to them all. Finally: gold pieces are indivisible and arrangements to share pieces are not permitted (since no pirate trusts his fellows to stick to such an arrangement). It's every man for himself.
Which proposal will maximise the fiercest pirate's gain?
Omohundro's contribution is to ask the same question, but with 500 pirates instead of 10. (Still 100 gold pieces.) For convenience, number the pirates from 1 upwards in order of meekness, so that the least fierce is number 1, the next least fierce number 2, and so on. The fiercest pirate thus gets the biggest number, and proposals proceed in reverse numerical order from the top down.
Following Omohundro, I'm going to try to convince you that the answer to the 10-pirate version of the problem is this: Pirate 10 proposes to keep 96 gold pieces for himself, to give one gold piece to each of pirates 8, 6, 4, and 2, and none to the odd-numbered pirates.
In contrast, the answer to the 500-pirate version involves the first 44 pirates being thrown overboard, after which pirate number 456... but I'm in danger of giving too much away, so you'll have to wait for the rest.
The secret to analysing all such games of strategy is to work backwards from the end. At the end, you know which decisions are good and which bad. Having established that, you can transfer that knowledge to the next-to-last decision, then the next-to-next-to-last, and so on. Starting from the front, in the order in which the decisions are actually taken, doesn't get you very far. The reason is that strategic decisions are all about 'What will the next person do if I do this...?' so the decisions that follow yours are important. The ones that come before yours aren't, because you can't do anything about them anyway.
Bearing this in mind, the place to start is when (if!) the game gets down to just two pirates, P1 and P2. The fiercest pirate is P2, and (if the game ever gets this far) his optimal decision is obvious: propose 100 pieces for himself and none for P1. His own vote is 50% of the total, so it wins.
Now add in pirate P3. Pirate P1 knows --- and P3 knows that he knows --- that if P3's proposal is voted down, P1 will get nothing. So P1 will therefore vote in favour of anything that P3 proposes, provided it yields him more than nothing. P3 therefore uses as little as possible of the gold to bribe P1,leading to the following proposal: 99 to P3, 0 to P2, and 1 to P1. Let's write it out like this:
P1 P2 P3
1 0 99
The thought-processes of P4 are similar. He needs 50% of the vote, so again he needs to bring exactly one other pirate on board. The minimum bribe he can use is one gold piece, and he can offer this to P2 since P2 will get nothing if P4's proposal fails and P3's is voted on. So the proposed allocation here becomes
P1 P2 P3 P4
0 1 0 99
The thought-processes of P5 are slightly more subtle. He needs to bribe two pirates. The minimum bribe he can use is two gold pieces, and the unique way he can succeed with this number is to propose the allocation
P1 P2 P3 P4 P5
1 0 1 0 98
The analysis proceeds in the same manner, with each proposal being uniquely prescribed by giving the proposer the maximum possible subject to ensuring a favourable vote, until we get to the tenth pirate and find the allocation
P1 P2 P3 P4 P5 P6 P7 P8 P9 P10
0 1 0 1 0 1 0 1 0 96
This is the proposal I indicated earlier, and it solves the 10-pirate puzzle.
Now comes Omohundro's twist: what if there are (a lot) more pirates? Clearly the same pattern persists --- for a while. In fact it persists up to the 200th pirate. P200 will offer nothing to the odd numbered pirates P1-P199, and one gold piece to each of the even-numbered pirates P2-P198, leaving one for himself.
However, we're trying to find the strategy for P500. At first sight, the argument breaks down at P200, because P201 has run out of bribes. However, he still has a vested interest in not being thrown overboard, so he can propose to take nothing himself:
P1 P2 P3 P4 ... P197 P198 P199 P200 P201
1 0 1 0 ... 1 0 1 0 0
This opens up a new phase of the strategy, because pirate P202 knows that P201 has to accept nothing, or be thrown overboard. So he can count on P201's vote. However, P202 also is forced to accept nothing. He must use the entire 100 gold pieces to bribe 100 pirates --- and these must be among those who would get zero according to P201's proposal. Since there are 101 such pirates, P202's proposal is no longer unique. Let's use a star * to mark pirates who might get something from P202's proposal:
P1 P2 P3 P4 ... P197 P198 P199 P200 P201 P202
0 * 0 * ... 0 * 0 * * 0
At this point a further consideration comes into force. Pirates have to think about how a pirate who has some chance of receiving a gold piece will react if he is definitely offered a gold piece. This depends on how much money, on average, he is willing to sacrifice for the fun of throwing somebody overboard. The puzzle doesn't specify this, so it is reasonable to assume that the pirates don't know it. This means that the bribe might succeed, so it is rational to offer a definite piece to a pirate who only has a chance of one later. As it turns out, things are more satisfactory: there are enough definite 0's in each round for pirates from now on to offer the bribes only to them. Indeed, as Peter Norvig pointed out, there will always be 100 allocations of 0 among pirates P1-P200, so we may describe a solution in which bribes are offered only to (some of) them.
You may wonder why I'm bothering about all this, since clearly pirate 203 doesn't have enough cash available to bribe enough pirates to vote for his proposal, whatever it is. This is true, and P203 goes overboard whatever he proposes --- a result that we symbolize by a cross X. We write ? to show that his choices are irrelevant, giving the situation:
P1 P2 P3 P4 ... P197 P198 P199 P200 P201 P202 P203
? ? ? ? ... ? ? ? ? ? ? X
Even though P203 is destined to walk the plank, this doesn't mean that he plays no part in the proceedings. On the contrary, P204 now knows that P203's sole aim in life is to avoid having to propose a division of the spoils. So P204 can count on P203's vote, whatever P204 proposes. Now P204 just squeaks home --- he can count on his own vote, P203's vote, and 100 others from bribes of one gold coin: 102 votes in all, the necessary 50%. So P204 will propose
P1 P2 P3 P4 ... P197 P198 P199 P200 P201 P202 P203 P204
* 0 * 0 ... * 0 * 0 * * 0 0
What of P205? He is not so fortunate! He cannot count on the votes of P203 or P204: if they vote against him they have the fun of throwing him overboard and can still save themselves. So P205 gets a X. So does P206 --- he can be sure of P205's vote, but that's not enough. In the same way, P207 needs 104 votes --- 3 plus his own plus 100 from bribes. He can get the votes of P205 and P206, but he needs one more and it's not available. So P207 gets a X as well.
P208 is more fortunate. He also needs 104 votes, but P205, P206, and P207 will vote for him! Add in his own vote and 100 bribes and he's in business. He must offer bribes to those pirates that would get 0 according to P204's proposal:
P1 P2 P3 P4 ... P198 P199 P200 P201 P202 P203 P204 P205 P206 P207 P208
0 * 0 * ... * 0 * 0 0 * * 0 0 0 0
Now a new pattern has set in, and it continues indefinitely. Pirates who make proposals (always to give themselves nothing and to bribe 100 of the first 200 pirates) are separated from each other by ever-longer sequences of pirates who will be thrown overboard no matter what proposal they make --- and whose vote is therefore ensured for any fiercer pirate's proposal. The pirates who avoid this fate are numbers P201, P202, P204, P208, P216, P232, P264, P328, P456,... and so on --- 200 plus a power of 2.
It remains to work out who are the lucky recipients of the bribes, just to make sure they will accept them. As I said, the solution is not unique, but one way to do this is for P201 to offer bribes to the odd-numbered pirates P1-P199, for P2 to offer bribes to the even-numbered pirates P2-P200, then P204 to the odd numbers, P208 to the evens, and so on, alternating from odd to even and back.
At any rate, we conclude that with 500 pirates and optimal strategy, the first 44 pirates are thrown overboard, and then P456 offers one gold piece to each of P1, P3, P5 ..., P199. Thanks to their democratic system, the pirates have arranged their affairs so that the very fierce ones mostly get thrown overboard, and at best can consider themselves lucky to escape with none of the loot. Only the 200 meekest pirates can expect anything, and only half of them.
Truly, the meek shall inherit the worth.