Join Date: Feb 2005
Posts: 4,165
It can be done with inclusion-exclusion, but the 2nd and 3rd terms are uglier than heehaww's.
His first term is correct: the expected number of "unmarked" villagers is 15(4/5)^n after the n-th round, and after 13 rounds this is 0.8246. This means that there is less than a 82.46% chance of it happening, because there may be more than one villager.
At each round, if there are k unmarked villagers, and we randomly draw 3 people to mark, there will be k-i unmarked villagers in the next round with probability (k C i ) (15-k C 3-i) / (15 C 3).
After 1 round, there will always be 12 unmarked villagers.
After 2 rounds, there will be 9, 10, 11, or 12 unmarked villagers, with probability 44/91, 198/455, 36/455, and 1/455 respectively.
We can find the exact distribution of outcomes after n rounds by building ourselves a Markov chain. If we do so, we'll find the situation after 13 rounds is:
No unmarked villagers - 39.30% (176798797208157578719462291776 / 449882097396212029388505859375, if you want to be precise);
1 - 42.05%
2 - 15.78%
3 - 2.65%
4 - 0.21%
5 - 0.008%
6 - 0.00013%, etc.
A person who didn't want to do all that multiplying might also have guessed that a Poisson distribution with mean 15(4/5)^n would get us close. It gets us at least sort of close: Poisson would have said P(0 unmarked villages)=.4384 instead of .4205.
Incidentally, you were somewhat unlucky to be one of only two left after 8 rounds: there's about a 30% chance of 2 or 3 left, and 15% chance of 1 or 4 left, at that point. Only a 2.7% chance of hitting everyone in the first 8 rounds.
Edited to add the correct inclusion-exclusion terms:
The first term is the number of people, times the chance of one person being missed every time, to the 13th power: 15(4/5)^13 = .8246.
The second term is the number of pairs of people, times the chance of both people being missed every time, to the 13th power: 15C2 * (13C3/15C3)^13 = 105 * (286/455)^13 ~ 0.2511
The third term: 15C3 * (12C3/15C3)^13 = 455 (220/455)^13 ~ .0359.
The true probability of at least one person being missed, .6070, is indeed between .8246-.2511=.4735 and .8245-.2511+.0359=.6094, and closer to the latter.
Last edited by Siegmund; 04-14-2019 at 02:07 AM.