next up previous
Next: About this document ...

Statistics 101: Homework 2

Exercise 1

a.
If A and B are independent,

\begin{displaymath}{\bf P}(A\vert B)={\bf P}(B\mbox{ and }A)/{\bf P}(A)={\bf P}(B){\bf P}(A)/{\bf P}(A)={\bf P}(B).\end{displaymath}

If A and B are independent, then not A and B are independent. So,

\begin{displaymath}{\bf P}(B\vert\mbox{not }A)=P(B\mbox{ and not }A)/{\bf P}(\mb...
...bf P}(B){\bf P}(\mbox{not }A)/{\bf P}(\mbox{not }A)={\bf P}(B).\end{displaymath}

Since ${\bf P}(A\vert B)={\bf P}(B)$ and ${\bf P}(B\vert\mbox{not }A)={\bf P}(B)$, then ${\bf P}(B\vert A)={\bf P}(B\vert\mbox{not }A)$.
b.
First note that

\begin{displaymath}{\bf P}(\mbox{not }A\mbox{ and }\mbox{not }B)=1-{\bf P}(A\mbox{ or }B)=1-{\bf P}(A)-{\bf P}(B)+{\bf P}(A\mbox{ and }B).\end{displaymath}

Since A and B are independent we get that

\begin{displaymath}{\bf P}(\mbox{not }A\mbox{ and }\mbox{not }B)= 1-{\bf P}(A)-{\bf P}(B)+{\bf P}(A){\bf P}(B)\end{displaymath}


\begin{displaymath}=(1-{\bf P}(A))(1-P(B))={\bf P}(\mbox{not }A){\bf P}(\mbox{not }B). \end{displaymath}

c.
Let

\begin{displaymath}A_1=\{A\mbox{ wins the first match and }A\mbox{ wins the second match}\},\end{displaymath}


\begin{displaymath}A_2=\{A\mbox{ wins the first match and }B\mbox{ wins the second match}\},\end{displaymath}


\begin{displaymath}A_3=\{B\mbox{ wins the first match and }A\mbox{ wins the second match}\},\end{displaymath}


\begin{displaymath}A_4=\{B\mbox{ wins the first match and }B\mbox{ wins the second match}\}.\end{displaymath}

We have

\begin{displaymath}\begin{array}{cl}
{\bf P}(A_1)&={\bf P}(A\mbox{ wins the firs...
...t A\mbox{ wins the first match})\\
&=0.5*0.6=0.3,
\end{array}\end{displaymath}


\begin{displaymath}\begin{array}{cl}
{\bf P}(A_2)&={\bf P}(A\mbox{ wins the firs...
...t A\mbox{ wins the first match})\\
&=0.5*0.4=0.2.
\end{array}\end{displaymath}

Since ${\bf P}(A_1)+{\bf P}(A_3)=.5$ (A wins the second match) we get that ${\bf P}(A_3)=0.2$. Hence,

\begin{displaymath}{\bf P}(A\mbox{ wins at least one match})={\bf P}(A_1)+{\bf P}(A_2)+{\bf P}(A_3)=0.7.\end{displaymath}

Exercise 2

a.
As usual, a good start is to write out the probability information stated in the problem.

\begin{displaymath}\begin{array}{lll}
{\bf P}(\mbox{failure})=0.60 & {\bf P}(\mb...
...ajor})=0.30 & {\bf P}(\mbox{good$\vert$major})=0.60
\end{array}\end{displaymath}

We now that ${\bf P}(\mbox{failure})=0.60$ and ${\bf P}(\mbox{poor$\vert$ failure})=0.5$. By the multiplication principle we find

\begin{displaymath}{\bf P}(\mbox{failure and poor})=0.6*0.5=0.3.\end{displaymath}

Repeating this process for all combination of results and ratings we construct the table of joint probability.

    Panel rating
    Poor Fair Good
  Failure 0.30 0.18 0.12
Result Moderate 0.06 0.12 0.12
  Major 0.01 0.03 0.06
Finally,

\begin{displaymath}{\bf P}(\mbox{failure$\vert$good})=\frac{{\bf P}(\mbox{ failure and good})}{{\bf P}(\mbox{good})}=0.12/0.30=0.40\end{displaymath}

b.
The tree diagram is shown below:
For the first branch of a probability tree, we pick the events that have known unconditional probabilities. In this case, that's the introduction results - failure, moderate, or major. For the next branches, we use the ratings - poor, fair, or good - and insert conditional probabilities given the preceding branch. Multiplying the probabilities on all paths completes the tree.

To find the conditional probability of major given poor, we need ${\bf P}(\mbox{major and poor})=0.01$ from the seventh path on the tree. Also, ${\bf P}(\mbox{poor})$ can be obtained by adding the probabilities for all paths including a poor rating. ${\bf P}(\mbox{poor})=0.3+0.06+0.01=.37$. Thus

\begin{displaymath}{\bf P}(\mbox{major$\vert$poor})=0.01/0.37=0.027.\end{displaymath}

Exercise 3

Two observations: Thus, we get that Since 0.3>0.21 and 0.255 it is best for A to shoot in the air. And respectively, ${\bf P}(A\mbox{ wins})=0.3$; ${\bf P}(B\mbox{ wins})=0.175$; ${\bf P}(C\mbox{ wins})=0.35$; ${\bf P}(B\mbox{ wins})=0.175$.

Exercise 4

a.
We must take into account not only the number of deductions, but also the frequencies. We could simply use the definition of the mean. Add up 0 201 times, 1 287 times, and so on, then divide by the total frequency. Or we could regard the data as grouped. In either case

\begin{displaymath}\bar{y}=\frac{0*201+1*287+\cdots+12*3}{201+287+\dots+3}=2.477.\end{displaymath}

By calculator, s=1.834. Because there are 1533 observations, it would make virtually no difference whether we divided by n or n-1. In fact, calculations show $\sigma=1.833$, calculated by dividing by n.

Recall that about 68$\%$ of the data should fall within one standard deviation of the mean. The range $\bar{y}\pm s$ is 0.643 to 4.311. It includes the values 1, 2, 3, and 4 and contains 1134 observations. This is 1134/1533=0.740 or 74$\%$ of the data. The Empirical Rule works surprisingly well, despite the skewness of the data.

b.
There is 1533 forms, of which 1332 claim at least 1 deduction.

\begin{displaymath}{\bf P}(\mbox{at least 1 deduction})=1332/1533=0.869.\end{displaymath}

We note that 287+364+332=983 forms claim at least 1 and at most 3 deductions, so

\begin{displaymath}{\bf P}(\mbox{at most 1$\vert$at least 1})=\frac{{\bf P}(\mbo...
...}{{\bf P}(\mbox{at least 1})}=\frac{983/1533}{1332/1533}=0.738.\end{displaymath}

Alternatively, there are 1332 forms with at least one deduction, of which 983 claim at most three deductions. Therefore,

\begin{displaymath}{\bf P}(\mbox{at most 1$\vert$at least 1})=\frac{983}{1533}=0.738.\end{displaymath}

Exercise 5

a.
Because the firm holds 30$\%$ of the market, we assume that the probability of winning any one bid is 0.30. The company can win $y=3,2,1, \mbox{ or } 0$ bids. To have Y come out equal to 3, the company must win all the bids.

\begin{displaymath}{\bf P}(Y=3)={\bf P}(WWW)=0.3*0.3*0.3=0.027.\end{displaymath}

Similarly,

\begin{displaymath}{\bf P}(Y=0)={\bf P}(LLL)=0.7*0.7*0.7=0.343.\end{displaymath}

The event Y=1 can occur in three different ways, corresponding to the choice of which of the three bids wins.

\begin{displaymath}{\bf P}(Y=1)={\bf P}(WLL)+{\bf P}(LWL)+{\bf P}(LLW)=0.3*0.7*0.7+0.7*0.3*0.7+0.7*0.7*0.3=0.441.\end{displaymath}

Similarity,

\begin{displaymath}{\bf P}(Y=2)={\bf P}(WWL)+{\bf P}(WLW)+{\bf P}(LWW)=0.189.\end{displaymath}

We assumed that the probability of winning was the 0.3 on all three bids and that whether or not one bid won didn't change the probability that any other bid would win. If there are no systematic difference among the bids, we can't think of any reason why the probability would change. Barring collusion, there's no obvious reason for dependence, either. The assumptions seem at least fairly reasonable.

b.

\begin{displaymath}{\bf E}(Y)=0*{\bf P}(Y=0)+\cdots+3*{\bf P}(Y=0)=0.9\end{displaymath}

and

\begin{displaymath}\mbox{Var}(Y)=0^2*(0.343)+1^2*(0.441)+2^2*(0.189)+3^2*(0.027)-(0.9)^2=0.63.\end{displaymath}

c.
The expected value shouldn't change. On average, the company should still win 30$\%$ of all its bids, and 30$\%$ of 3 is 0.9, as we found in the the previous exercise. What should change is the variance. The statement of the exercise indicates that there's higher probability that the company will either win all its bids or lose all its bids, that is, that Y will come out either 3 or 0. Therefore, there must be a smaller probability that Y will come out 1 or 2. If there is higher probability on the more extreme values, the variance must be larger.

Exercise 6

a.
The value of Y1 depends on the number of Reds on four consecutive rolls. This number has a binomial distribution. The probability distribution of Y1 is shown below:
Y1 0 16 32 48 64
${\bf P}$ 1/16 4/16 6/16 4/16 1/16
b.
We lose only if we get five Reds in a row, otherwise we win 1 dollar. The probability distribution of Y2 is shown below:
Y2 1 33
${\bf P}$ 1/32 31/32
c.
First criterion: Expected Value. Both have expectation of $\$$32, so same on this criterion.

Second criterion: Median. Median of Y1 is 32 but median of Y2 is 33, so Y2 is better.



 
next up previous
Next: About this document ...
Wenxin Mao
1999-10-05