Approximate Discrete Distribution With Continuous X4
Approximation means that, under certain conditions, another distribution provides a description of the data which is similar to the distribution the data were sample from. The limit theorems (e.g. Central Limit Theorem) provide theoretical tool for deriving such approximations. These limit theorems can be used to approximate a number of common distributions. Since these are approximations to the true distribution there may be some errors in the approximation. However, there are methods for evaluating the quality of the approximation. In the what follows we present approximations for a number of distributions as well as some of the criteria that can be used to evaluate the quality of these approximations. Normal distribution as limit of other distributions:
-
Approximation of Binomial distribution by Normal distribution:
This approximation is based on Laplace and DeMoivre's limit theorem.Let be independent, Bernoulli distributed random variables with and for all i. Then is random variable with Binomial distribution B(n,p), expected value E(X) = np and variance Var(X) = np(1-p).For , the distribution of the standardized random variable
converges to a standard Normal distribution N(0;1). For large n we have:
with the expected value and variance .
Since the Binomial distribution is discrete and the Normal distribution is continuous we improve the quality of approximation by using a continuity adjustment:
A rough rule of thumb for a good approximation for the Binomial distribution requires:np 5 or n(1-p) 5 .
-
Approximation of the Poisson distribution by the Normal distribution
The Poisson distribution with = np can be derived from a Binomial distribution. Since the Binomial distribution can be approximated by the Normal distribution this suggests that the Normal distribution can also approximate the Poisson distribution.
Let X be a random variable with the distribution PO( ). Then for large we approximate the Poisson distribution using a Normal distribution with expected value and variance (with the continuity correction):
The rule of thumb for a "reasonable " approximation requires: 10
-
Approximation of Hypergeometric distribution by Normal distribution
Let and . Then a random variable with Hypergeometric distribution can be approximated using a Normal distribution with the parameters:
We can also use the continuity correction to improve the approximation.
-
Approximation of Hypergeometric distribution by Binomial distribution
The Binomial and Hypergeometric distributions use different sampling methods: the Binomial distribution uses draws with replacement and the Hypergeometric distribution uses draws without replacement. As M and N increase, M/N converges to a constant p, the difference between these two distributions becomes much smaller. As N (and M ) the Hypergeometric distribution converges to a Binomial distribution. This implies: for large N and M as well as small n/N , the Hypergeometric distribution can approximated by the Binomial distribution with parameters p = M/N. The rule of thumb requires: n/M 0.05 .
-
Approximation of Binomial distribution by Poisson distribution
The Poisson distribution can also derived from a Binomial distribution. Consequently, the Binomial distribution can also be approximated by a Poisson distribution PO( =np), if n is large and the probability p is small.Rule of thumb: and p 0.05.
In a certain town, one house in each 100 is damaged every year because of storms. What is the probability that storms damage four houses in a year if the town contains 100 houses?For each house, there are only two possible outcomes – "damage" and "no damage". The probabilities of these outcomes are constant: and . The random variable = {number of damaged houses} has the binomial distribution . We compute the probability : We could also use the Poisson distribution (with parameter ) to approximate this probability since the conditions for a good approximation are satisfied: We see that the probabilities and are fairly close. More generally, the approximation is also good at other points in the distribution.
| 0 | 0.36603 | 0.36788 |
| 1 | 0.36973 | 0.36788 |
| 2 | 0.18486 | 0.18394 |
| 3 | 0.06100 | 0.06131 |
| 4 | 0.01494 | 0.01533 |
| 5 | 0.00290 | 0.00307 |
| 6 | 0.00046 | 0.00051 |
| 7 | 0.00006 | 0.00007 |
| 8 | 0.00000 | 0.00000 |
After a storm, there are damaged 300 houses out of total 2000 in a region. What is the probability that there are exactly 2 damaged houses in 10 randomly chosen houses? Again, there are only two possible outcomes for each house – "damage" and "no damage". Furthermore, , , and . The probability is equal to This calculation is fairly demanding. Fortunately, we can use Binomial distribution (with parameter to approximate this probability:
Based on experience, we know that 10% of tax returns from a certain town have errors. Using a sample of 100 tax returns from this town – what is the probability that 12 of them contain errors? There are only two possible outcomes for the experiment – "wrong" or "correct", with corresponding probabilities and . The random variables – "number of wrong tax returns from 100 randomly chosen ones" has the Binomial distribution . We need to compute the probability : If the value is not contained in the tables, we would have to compute, which might be fairly cumbersome. However, since the conditions for the validity of an approximation using the Normal distribution are satisfied ( and ), we could approximate the probability with a Normal distribution . The expected value and the variance of the Binomial distribution are: so we could use a distribution (see the diagram). Recall: for a continuous random variable, the probabilities are given by the area under the density and thus the probability of one specific value is always equal to zero, e.g., . Therefore, we subtract and add 0.5 to 12; this is a sort of continuity correction. Instead of (for the discrete variable) we use an interval for the continuous and is then approximated by , i.e., the area under the density between the points 11.5 and 12.5.
The tables only contain the distribution function of a random variable so we have to standardize the random variable : Using the Normal tables, we obtain and .Hence, The approximation works reasonably well, the error of the approximation is only 0.1052 - 0.0988 = 0.0064. We can also see that:
Source: https://wikis.hu-berlin.de/mmint/Basics:_Approximation_of_Distributions/en
0 Response to "Approximate Discrete Distribution With Continuous X4"
Post a Comment