Sampling Probability#
A sampling probability is often one at which follows these sampling rules. For \(N\) individuals we wish gather the individuals into \(n\) subsets:
Each sample cannot be picked more than once (no replacement)
Order does not matter
The probability of a given subset following these two rules being chosen can be solved two ways:
Brute Force - Number of Configurations#
The brute force methods requires the knowledge of all possible configurations. This is the method that is often first taught in statistics and following our rules it sounds like we have to use combination (order does not matter). Recall the combination is defined as the number of configuration of group one can make with \(n\) things and each group has \(k\) things. Combination also simply quoted as the “\(n\) choose \(k\)” function which is,
The population of the data is \(N=10\), if you were to group the population into pairs (\(k=2\)) how many groups can you make?
Back to the sampling problem. Here the number of things are subsets and the the groups are the sample. We wish to ask what is the probability of the sample contain a certain subset. Let’s call this particular subset \(A\),
The sample are once again in pairs, but now my subsets are marked from \(A\) to \(Z\) (\(n=26\)), what is the probability of choosing \(A\) as part of my sample?
Probability by Recursion#
You may also find the probability by multiplying