Calculate variance, standard deviation ...
Variance and Standard Deviation for Conditional Discrete Distributions In the previous readings, we... Read More
Let \(X_1, X_2,\ldots,X_n\) be random variables and let \(c_1, c_2,\ldots, c_n\) be constants.
Then,
$$ Y=c_1X_1+c_2X_2+\ldots+c_nX_n $$
is a linear combination of \(X_1, X_2,\ldots, X_n\).
In this reading, however, we will only base our discussion on the linear combinations of independent normal random variables.
In statistics, it is usually assumed that a sample is drawn from a population that is normally distributed with mean \(\mu\) and variance \(\sigma^2\).
Suppose we have two random variables \(X_1\sim N(\mu_1,\sigma_1^2)\) and \(X_2 \sim N(\mu_2,\sigma_2^2)\), then \(S=X_1+X_2 \sim N\left(\mu_1+\mu_2,\sigma_1^2+\sigma_2^2\right)\).
We can extend the this to more than two random variables:
Now, let \(X_1, X_2,\ldots,X_n\) be independent normal random variables with means \(\mu_1, \mu_2,\ldots,\mu_n\) and variances \(\sigma_1^2, \sigma_2^2,{\ldots,\sigma}_n^2\), respectively. Then, it can be proven that \(X_1+ X_2+\ldots+X_n\) is normally distributed with mean \(\mu_1+ \mu_2+\ldots+\mu_n\) and variance \(\sigma_1^2+ \sigma_2^2+{\ldots+\sigma}_n^2\). This implies that the sum of independent Normal distributions is Normal.
Similarly, we can prove that the difference of independent Normal distributions is Normal.
If \(X_1\sim N(\mu_1,\sigma_1^2)\) and \(X_2\sim N(\mu_2,\sigma_2^2)\), then \(D=X_1-X_2\sim N\left(\mu_1-\mu_2,\sigma_1^2+\sigma_2^2\right)\).
This is because:
$$ E\left[D\right]=E\left[X_1-X_2\right]=E\left[X_1\right]-E\left[X_2\right]=\mu_1-\mu_2 $$
and,
$$ \begin{align*} Var\left(D\right) & =Cov\left(D,D\right)=Var\left(X_1\right)-2Cov\left(X_1,X_2\right)+Var\left(X_2\right) \\
& =Var\left(X_1\right)-0+Var\left(X_2\right) \\
& =Var\left(X_1\right)+Var\left(X_2\right) \\ & =\sigma_1^2+\sigma_2^2 \end{align*} $$
We can calculate the probability for linear combinations of independent normal random variables. This is illustrated in the following examples:
A certain brand of refrigerator has a useful life that is normally distributed with mean 10 years and standard deviation 3 years. The useful lives of these refrigerators are independent. Calculate the probability that the total useful life of two randomly selected refrigerators will exceed 1.9 times the useful life of a third randomly selected refrigerator.
Solution
Let \(X_i\) be the random variable representing the useful life of refrigerator \(i\), \(i=1, 2, 3\)
\(\left\{X_i\right\}\) is an iid collection of r.v.’s with common distribution \(X\sim N(\mu=10,\sigma^2=9)\)
We seek \(Pr\left(X_1+X_2 \gt 1.9X_3\right)=Pr\left(X_1+X_2-1.9X_3 \gt 0\right)\)
Let \(S=X_1+X_2-1.9X_3\)
Now,
$$ \begin{align*} E\left[S\right] & =E\left[X_1\right]+E\left[X_2\right]-E\left[1.9X_3\right] \\
& =E\left[X_1\right]+E\left[X_2\right]-1.9E\left[X_3\right] \\
& =E\left[X\right]+E\left[X\right]-1.9E\left[X\right] \ \left(\text{Identically distributed}\right) \\
& =10+10-1.9\left(10\right)=1
\end{align*} $$
And,
$$ \begin{align*}
Var\left(S\right) & =Var\left(X_1\right)+Var\left(X_2\right)+Var\left(-1.9X_3\right)\ \ \ \ (\text{Independence}) \\
& =Var\left(X_1\right)+Var\left(X_2\right)+\left(-1.9\right)^2Var\left(X_3\right) \\
& =Var\left(X\right)+Var\left(X\right)+\left(-1.9\right)^2Var\left(X\right)\ \ \ \text{Identically Distributed} \\
& =9+9+\left(-1.9\right)^2\left(9\right)=50.49 \end{align*} $$
$$ \therefore S\sim N(\mu_S=1, \sigma_S^2=50.49) $$
$$ \begin{align*} \Pr{\left(S \gt 0\right)} &=\Pr{\left(\frac{S-\mu_S }{\sigma_S} \gt \frac{0-1}{\sqrt{50.49}}\right)}\ \ \ \ \ (\text{Standardize}) \\
&=\Pr{\left(Z \gt -0.1407\ldots\right)} \\
& =\Pr{\left(Z \lt 0.1407\ldots\right)}\approx0.5557 \\
\end{align*} $$
The annual incomes of three businessmen are normal with means 1000, 2000 and 3000, respectively. The variances of income are 750, 950, and 800, respectively. Suppose that the annual incomes are independent.
Find the probability that the total annual income is exactly 6,100.
Let \(X_1, X_2,X_3\) be the random variables for the income of the three businessmen, respectively.
Since, \(X_1, X_2,X_3\) are normal with means 1000, 2000 and 3000 and variances 750, 950 and 800, respectively, then \(X_1+X_2+X_3\) is normal with mean \(1,000+2,000+3,000=6,000\) and variance \(750+950+800=2,500\).
We wish to find,
$$ P\left(X_1+X_2+X_3=3,000\right)=P\left(Z=\frac{6,100-6,000}{\sqrt{2,500}}\right)=P\left(Z=2\right) $$
and from the standard normal table,
$$ P\left(Z=2\right)=0.9772 $$
Let \(X_1\) and \(X_2\) be the random variables for the amount of sickness benefits given to two individuals, A and B, respectively. \(X_1\) and \(X_2\) are independent with the following probability density functions:
$$ f\left(x_1\right)= \left\{ \begin{matrix} \frac{1}{32\pi}{e^{-\frac{1}{2}\left(\frac{x_1-20}{4}\right)^2}}, & x_1 \geq 0 \\ 0,& \text{elsewhere} \end{matrix} \right. $$
and,
$$ f\left(x_2\right)= \left\{ \begin{matrix} \frac{1}{18\pi}{e^{-\frac{1}{2}\left(\frac{x_2-20}{3}\right)^2}}, & x_2 \geq 0 \\ 0,& \text{elsewhere} \end{matrix} \right. $$
Find \(P(X_1+X_2\le 45)\)
Solution
From the given pdfs, we can see that \(X_1\) and \(X_2\) are normal random variables with means 20 and 30, respectively, and variances 16 and 9, respectively.
Now, since \(X_1\) and \(X_2\), are independent, then \(X_1+X_2\) is normal with mean 20+30=50 and variance \(16+9=25\).
We wish to find,
$$ \begin{align*}
P\left(X_1+X_2\le 45\right) & =P\left(Z<\frac{45.5-50}{\sqrt{25}}\right) \\
& =P\left(Z\le-0.9\right) \\
& =1-0.8159 \\
& =0.1841
\end{align*} $$
Alternatively, we can find this probability by integration,
$$ \begin{align*} P\left(X_1+X_2\le 45\right) & =\int_{0}^{45.5}{\frac{1}{\sqrt{25\times2\times\pi}}e^{-\frac{1}{2}\left(\frac{x_2-50}{5}\right)^2}} \\
&=0.1841
\end{align*} $$
An actuary is analyzing the annual number of accidents in two neighboring cities, \(P\) and \(Q\), for its insured clients. Let \(X\) be the total annual number of accident claims from town \(P\) and let \(Y\) be the total annual number of accident claims from town \(Q\). \(X\) is normal with mean 10 and standard deviation 2. \(Y\) is also normal with mean 9 and standard deviation 2.5. Assume that X and Y are independent.
Calculate \(P(X+Y\geq 5)\).
Solution
Since \(X\) and \(Y\) are independent normal random variables, then \(X+Y\) is also normal with mean \(10+9=19\) and variance \(4+6.25=10.25\).
Now,
$$ \begin{align*} P\left(X+Y\geq5\right) & =P\left(Z\gt \frac{4.5-19}{\sqrt{10.25}}\right) \\
& =P\left(Z \gt -1.874\right)=0.9695
\end{align*} $$
Let \(X\) and \(Y\) be the number of days of hospitalization for two individuals, \(M\) and \(N\), respectively. \(X\) and \(Y\) are normally distributed as \(X\sim N(20, 9)\) and \(Y\sim N(11, 7)\). Assume that \(X\) and \(Y\) are independent.
Find the probability that the difference between the number of days of hospitalization for \(M\) and \(N\) does not exceed 3.
Solution
Since \(X\sim N(20, 9)\) and \(Y\sim N(11, 7)\) and that \(X\) and \(Y\) are independent, then the random variable,
$$ X-Y\sim N\left(20-11, 9+7\right)\Rightarrow X-Y \sim (9, 16). $$
We wish to find,
$$ \begin{align*} P\left(X-Y\le3\right) & =P\left(Z<\frac{3.5-9}{\sqrt{16}}\right) \\ & =P(Z \lt -1.375) \\ & =1-0.9155=0.0845 \end{align*} $$
Below is the pdf for the standard normal table required for this reading:
Please click the above icon to view the table.
Question
Consider two independent machines \(A\) and \(B\) producing screws. The lengths of the screws from machine A are normally distributed with a mean of 5 cm and a standard deviation of 0.1 cm. The lengths of the screws from machine B are normally distributed with a mean of 5.1 cm and a standard deviation of 0.15 cm.
A quality control engineer selects one screw at random from each machine and combines them into a pair. What is the probability that the total length of the pair of screws exceeds 10.3 cm?
- 0.1587
- 0.1335
- 0.8413
- 0.9772
- 0.9956
Solution
The correct answer is B.
To find the probability that the total length of the pair of screws exceeds 10.3 cm, we first need to determine the distribution of the sum of their lengths. Since the lengths are independent normal random variables, the sum will also be normally distributed. The mean of the sum will be the sum of the means, and the variance of the sum will be the sum of the variances (since standard deviation is the square root of variance and variance is additive for independent random variables).
The mean of the total length is:
$$ \mu_{\text{total}}=\mu_A+\mu_B=5+5.1=10.1\ cm $$
The variance of the total length is:
$$ \sigma_{\text{total}}^2=\sigma_A^2+\sigma_B^2=0.01+0.0225=0.0325\ cm^2 $$
The standard deviation of the total length is the square root of the variance:
$$ \sigma_{\text{total}}=\sqrt{0.0325}=0.1803\ cm $$
Now, we convert the question into a standard normal distribution problem by finding the Z-score for 10.3 cm:
$$ Z=\frac{X-\mu_{\text{total}}}{\sigma_{\text{total}}}=\frac{10.3-10.1}{0.1803}=1.11 $$
Using the standard normal distribution table, a Z-score of 1.11 corresponds to a probability of approximately 0.8665. This is the probability that the total length is less than 10.3 cm. We want the probability that the total length exceeds 10.3 cm, so we subtract this value from 1:
$$ P\left(X \gt 10.3\right)=1-P\left(X \lt 10.3\right)=1-0.8665=0.1335 $$
The probability that the total length of the pair of screws exceeds 10.3 cm is therefore approximately 0.1335.
Learning Outcome
Topic 3. g: Multivariate Random Variables – Calculate probabilities for linear combinations of independent normal random variables.