I have only skimmed the above posts so that I can post my own derivations without being accused of copying.
Without loss of generality, I assume T>N1>=N2>=N3>0.
Quote:
(1) Given two vectors V1 and V2 of length N1 and N2, respectively, each consisting of unique integers from 1 to T (note that uniqueness only applies to each individual vector, there may be overlap between the two vectors) chosen at random.
What is the probability that the union of V1 and V2 contain all of the integers from 1 to T?
Choose any of the V1 vectors. How many of the possible V2 vectors will fill in all the missing holes to achieve total coverage of 1 to T?
Clearly there are T-N1 holes to be filled. So T-N1 slots of V2 are spoken for.
This means that there are N2-(T-N1) slots available for duplication. And these available slots can take on T-(T-N1) values.
So, doing a little bit of algebra, this implies that this Prob:
= C(N1,N1+N2-T) / C(T,N2)
Quote:
(2) Given three vectors V1, V2, and V3 of length N1, N2, and N3, respectively, each consisting of unique integers from 1 to T (note that uniqueness only applies to each individual vector, there may be overlap between the three vectors) chosen at random.
What is the probability that the union of V1, V2, and V3 contain all of the integers from 1 to T?
This can be derived by considering choosing [V1,V2] at random, and then add [V3] using the formula derived above.
The number of duplicated elements in U[V1,V2] = D can take values 0,1,...,N2.
It is clear that the probability of two randomly chosen V1 and V2 vectors having D duplicated elements is:
P(D) = C(N1,D)*C(T-N1,N2-D) / C(T,N2)
Now let U[V1,V2] be denoted as U. And further let UU be the unique elements of U. Clearly, the number of elements of UU is N1+N2-D.
Using the same formula derived above, we see that the prob of having UU and V3 cover T to be:
= C(N1+N2-D,N1+N2-D+N3-T) / C(T,N3)
Combining these two formulas, that is, weighting these probs by the prob of each D, yields the overall prob:
= Sum[D=0,...,N2] C(N1,D)*C(T-N1,N2-D)/C(T,N2) * C(N1+N2-D,N1+N2+N3-T-D)/C(T,N3)
Quote:
(3) Same problem as (2) except that we now further require that no union of any pair of vectors contain all of the integers from 1 to T. That is, V1 & V2 do not cover T; V1 & V3 do not cover T; and V2 & V3 do not cover T.
I believe that I have correctly derived this formula, but I am not 100% sure. My method is to start with the prob derived in (2) and multiply it by the prob of having no pairwise coverage.
Of course, the pairwise coverage cases are as follows:
1. [V1,V2] only cover
2. [V1,V3] only cover
3. [V2,V3] only cover
4. [V1,V2] and [V1,V3] only cover
5. [V1,V2] and [V2,V3] only cover
6. [V1,V3] and [V2,V3] only cover
7. [V1,V2] and [V1,V3] and [V2,V3] cover
We want to find the probability of the converse case, of course.
The prob for each of the individual pairwise cases was derived in (1) above. I believe that independence can be used to show the prob of the "twin" pairwise cases is simply the product of the two individual pairwise probs. However, I believe that the "triple" pairwise case is not independent of the other cases.
I believe that the prob of no pairwise cases is equal to the following:
= 1 - [(1-P[(V1,V2) do not cover])*(1-P[(V1,V3) do not cover])*(1-P(V2,V3) do not cover]) + P[(V1,V2) cover]*P[(V1,V3) cover]*P[(V2,V3) cover]]
Call the above PNPC (for Prob of No Pairs Cover).
Then, the prob for (3) is:
= PNPC * Prob for (2) derived above.