2.1

We note that \(Z\) is continuous, so we can interchange \(\leq\) and \(<\) as we want. Answers from using the normal table:

  1. 0.023

  2. 0.999

  3. 0.000 (Using R, more precisely: 0.0000425)

2.2

Regarding \(W\) we can use the formulas for mean and standard deviation in the compendium, and the fact that \(W\) becomes normal, allows us to use standardization to answer the questions.

We can note that in all three cases, the mean is the same (960), since this is not affected by correlations. The standard deviations on the other hand will differ, and it may be interesting to see how. Note that in all cases, question 2 is answered by the general formula

\[b = \mu_w + z_{0.05} \cdot \sigma_w = 960 + 1.645\cdot \sigma_w\;,\]

i.e. only \(\sigma_w\) will differ in the three cases. This can be seen if you solve the question of finding \(b\) such that

\(P[W \leq b] = 0.95\) for a general normal variable with parameters \(\mu_w, \sigma_w\). Below we give the results of this in the three cases.

  1. We get \(\sigma_w = 97.6, \ b = 1121\).

  2. Now \(\sigma_w = 72.1, \ b = 1079\).

  3. Now \(\sigma_w = 57.3, \ b = 1054\).

We see that as correlation goes from 0.90 downwards through 0.40, 0 and eventually to -0.40, the standard deviation

\(\sigma_w\) decreases. Accordingly, the 5% worst case cost comes down from 1121 to 1054. By definition of \(W\) the risk comes from variation in both \(X\) and \(Y\). In a case where \(\rho_{xy}\) is high positive, scenarios where both costs are high become relatively probable and total risk is high. When the correlation drops, we will more often see high \(X\) together with low \(Y\) and vice versa. More of the individual risk in \(X\) and \(Y\) is cancelled out, as the correlation drops.

2.3

  1. \(\mu_1 = 20, \sigma_1 = \sqrt{8} = 2.83\).

  2. \(\mu_2 = 20, \sigma_2 = 4\).

  3. \(\mu_3 = 0, \sigma_3 = \sqrt{8} = 2.83\). (Note that here \(Y = X - Y = aX + bY\) with \(a = 1, b = -1\))

  4. \(\mu_4 = 15, \sigma_4 = 2\). (You can treat 5 as a random variable with 0 variance).

  5. \(\mu_5 = 25, \sigma_5 = \sqrt{8}\). (This is the same as previous question, just replacing \(X\) by \(X+Y\).)

2.4

  1. The sample size is large enough (\(n \geq 30\)) to allow the use of so-called z-interval, i.e. we may use the formula \[ \bar{x} \pm z_{_{\alpha/2}}\frac{S}{\sqrt{n}}. \] The resulting interval is [16.51, 17.49].

  2. We find \(\hat{p} = 0.25\), we see that \(n\hat{p}(1 - \hat{p}) = 75\) so we are safe when using the formula from the text. We use \[ \hat{p} \pm z_{_{\alpha/2}}\sqrt{\frac{\hat{p}(1 - \hat{p})}{n} } \] to get the interval [0.207, 0.292]

  3. In case (i) \(ME\) would be smaller. (Because we divide by \(\sqrt{n}\).) In case (ii) \(ME\) would be larger. (Because \(z_{0.005} > z_{0.025}\).)

  4. Apparently, a small number of flats takes exceptionally long time before sale. In such cases the median can be a better choice to describe typical characteristics. The mean can be misleading in the sense that almost all of the sample (and thus probably the population) lies below the mean. This obviously does not happen with the median.

2.5

  1. Use the formula… \[ \bar{x} \pm z_{0.025}\frac{s}{\sqrt{n}} = 164 \pm 1.715. \]

  2. As the value 160 is relatively far below the lower confidence limit for \(\mu\), it is highly unlikely that we have \(\mu < 160\). It appears to be clear that the goal is not reached.

  3. Using the standard formula (noting that the sample is by far large enough) we get \[ \hat{p} \pm z_{0.025} \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}} = 0.125 \pm 0.016 \]

  4. Here (i) is true, while (ii) is false.

2.6

  1. Let \(X\) denote a weeks sale. Then we use that \(X\) is normal. Standardization leads to

\[ P[X < 1050] = P[Z < 1.5] = 0.9332. \]

  1. We need to find \(b\) such that \(P[X > b] = 0.05\). Some calculations based on the standard normal leads to \(b = \mu + z_{0.05} \sigma\) = 1064.5

  2. Let \(Y\) denote the profit for a given week. Then \(Y = -200 + 1.10 X\) is a linear function of \(X\). It follows that

\[ \mu_y = -200 + 1.10 \mu_x = 790, \quad \sigma_y = 1.10 \sigma_x = 110. \]

2.7

  1. The loading time is a \(N(\mu,\sigma)\) variable, so by the empirical rule, \[ \mu \pm 2 \sigma \] gives the approximate interval [3,9]. Using 1.96 rather than 2 gives a slightly more exact interval.

  2. This is found by standardizing the normal variable \(T\) = loading time. The probability is 0.8413.