A week ago I sent the following to Allstat following the exchange re the airplane riddle. It never came back into my inbox, and it isn't in the Allstat archive, so I suspect it went into the wonted black hole. Possibly Allstat's spam filter rejects any message with RE: in the subject line - if so, we'd all better take note ... - - - - A McDonald's voucher seems scanty reward for Tim Earl's elegant proof! Others have rightly pointed out the trivial fact that this result depends critically on the presupposition that the numbers of seats and passengers are identical, call this number n. Equally, of course, it doesn't matter what n is. The result is trivially true when n = 2, and easily demonstrated for n = 3 and 4, which suggests both that the answer is 1/2 for all n, and also a way to develop a general proof - but it's the less neat one, and Tim has shown how all the detail can be bypassed. This seems to point to a general issue - there can often be two proofs of the same result, one neat, one more turgid and elaborate. In 1927 Edward Wilson formulated the score confidence interval for the single proportion - not that the concept of a score interval had been developed then. This interval resembles the default Wald standard error based interval, but with inversion so that the SE imputed to the proportion is based on the hypothesised value, not the empirical estimate. The Wald interval is of course symmetrical on an additive scale (unless truncated to ensure it lies within [0,1] - something that is often done with Wald intervals, in contrast to the Wilson interval which is boundary-respecting so that this is never needed). In 1994 I noticed, from empirical figures calculated to high precision, that the Wilson interval is symmetrical on a logit scale. I was stunned that apparently it was 67 years before anyone noticed this - seeing that the logit scale is such a natural one for proportions. But how to prove it? An hour later I had hacked out an inelegant proof involving surds. OK, but I was sure there must be a simpler proof. Two days later, I found it. The Wilson lower and upper limits, L and U, are roots of a certain quadratic. So we can express their product as a simple function of the coefficients in the quadratic. We get a similar expression for (1-L)(1-U), divide, and it all falls out. Or indeed, there may be more than two proofs. There are numerous proofs of Pythagoras' theorem, and also striking ways to demonstrate it that fall short of being actual general proofs. My favourite of these relates to shapes called P-pentominoes, which consist of 5 adjacent unit squares, 4 of them forming a 2 by 2 square, the other one stuck on to the side of one of the others. Difficult to draw in email, but it looks something like the following, hence shaped like the letter P. ** ** * An infinite 2-dimensional plane can be tesselated into an infinite array of these shapes, all in the same orientation. Each has area 5 units, and corresponding points on adjacent pentominoes are sqrt(2**2 + 1**2) = sqrt(5) units apart. Which all makes me wonder - did Fermat have a neat proof after all?? Slightly less plausible, given that mathematicians had sought a simple proof of his theorem for centuries, whereas no-one had noticed the logit scale symmetry result. How reasonable is it to expect that a simple result should have a simple proof? Robert G. Newcombe PhD CStat FFPH Professor of Medical Statistics Department of Primary Care and Public Health Centre for Health Sciences Research Cardiff University 4th floor, Neuadd Meirionnydd Heath Park, Cardiff CF14 4YS Tel: 029 2068 7260 Fax: 029 2068 7236 Home page http://www.cardiff.ac.uk/medicine/epidemiology_statistics/research/statistics/newcombe For location see http://www.cardiff.ac.uk/locations/maps/heathpark/index.html >>> "Upton, Graham J" <[log in to unmask]> 31/01/08 10:30 >>> Slick solution from Tim Earl, a PhD student. Let A be the first person and Z be the last. Although there may be many misplaced people, the outcome is certain as soon as some passenger chooses to sit in the seat of either A or Z. At all times these seats are equally likely to be chosen by the currently displaced person. Thus the probability that Z sits in the correct seat is 1/2. Very surprising! Graham Upton Prof. G. J. G. Upton, Professor of Statistics/Environmetrics Dept of Mathematical Sciences/Centre for Environmental Science University of Essex, Colchester, Essex. CO4 3SQ http://www.essex.ac.uk/maths/staff/upton/ Tel: 01206 873027; Sec:01206 872704; Fax: 01206 873043 >>> "Leonid V. Bogachev" <[log in to unmask]> 31/01/08 11:51 >>> Dear Suhal & allstat, The answer is 0.5. The puzzle can be modelled using a Markov chain on the integers 1,...,100, with transition probabilities as follows: from state 1 it can jump to any state (including 1) with equal probability: p(1,i)=1/100 (i=1,...,100); from state 2 it can jump, with equal probability 1/99, either to state 1 or to any of the subsequent states, 3,...,100: p(2,i)=1/99 (i=1 or i=3,...,100); ............. in general, from state k it can jump, with equal probability 1/(100-k+1), either to state 1 or to any of the subsequent states, k+1,...,100; ............ finally, from state 100, it jumps to state 1 with pobability 1. The idea behind this Markov chain is to follow the sitting allocations of "displaced" passengers: the 1st passenger can choose any seat; if he takes say seat 10 then passengers 2,...,9 will take their own seats, but passenger 10 will have to go to seat 1 or to any of the remaining seats 11,...,100, etc. Now, in order that the 100th passenger gets his own seat, all we need is that our Markov chain hits state 1 prior to state 100. In other words, we need to find the return probability f_1 (i.e., starting from state 1 to get back to 1 before getting to state 100). Introducing the similar probabilities f_k (k=1,...,100), we have a set of difference equations: f_1=1/100 +(1/100)*(f_2+...+f_100), f_2=1/99 + (1/99)*(f_3+...+f_100), f_3=1/98 + (1/98)*(f_4+...+f_100), ........ f_k=1/(100-k+1) + (1/(100-k+1))*(f_{k+1}+...+f_100), ........ f_98=1/3 + (1/3)*(f_99+f_100), f_99=1/2 + (1/2)*f_100, f_100=0 (boundary condition). Solving this system "in the reverse order" we obtain (by induction) f_99=1/2, f_98=1/3+(1/3)*(1/2)=1/2, ... f_k=1/(100-k+1) + (1/(100-k+1))*(100-k-1)*(1/2)=1/2, ... hence f_1=1/2, QED. Given such a simple answer, I don't quite like the solution above, as it is too technical. There must be a "smart" solution, but knowing the answer may help find it! Many thanks for a beautiful puzzle. Dr Leonid V. Bogachev Reader in Probability Department of Statistics Tel. +44 (0)113 3434972 University of Leeds Fax +44 (0)113 3435090 Leeds LS2 9JT bogachev'at'maths.leeds.ac.uk United Kingdom www.maths.leeds.ac.uk/~bogachev -----Original Message----- From: A UK-based worldwide e-mail broadcast system mailing list on behalf of Suhal Bux Sent: Wed 1/30/2008 16:23 To: [log in to unmask] I know this isn't strictly the right sort of post but it's interesting and is doing my head in... 100 passengers are queuing to get on a plane. Each has a ticket with a seat number and they are standing in order. The first man in the queue is crazy and will sit anywhere at random. The rest of the passengers will sit in their own seat, unless it is not available, in which case they will sit in another seat at random. What is the probability that the 100th passenger sits in his own seat? I'd appreciate proofs or good reasoning.