Statistics 102H: Predictions and CI
Statistics 102H: Predictions and CI
Basic "theory"
Y = fit + noise
If everything goes well: relatively speaking fit has smaller variability
Prediction interval is based on SD of noise
Empirical rule:
+/- 1 = 2/3s of data
+/- 2 = 95% of data
+/- 3 = 99.8% (ivory soap) of the data
What can go wrong?
curvature
non-independence
hetroskadasticity
non-normality
Hence checking assumptions is necessary to believe intervals
Ignore fit inacuracy
start by assuming alpha, beta, sigma are perfictly estimated
What is the guess for y for a new observation? (Y-hat = alpha-hat + beta-hat X.)
Works if there is a lot of data
prediction intervals are basically flat
Try it with idealized regression? (Or use
diamond data
)
Homework
Compute the solution to the least squares equation I presented in class today. In other words, find the alpha and beta that minimize:
Sum(Y
i
- alpha - beta X
i
)
2
You can start looking at the
first short assignment.
Last modified: Mon Jan 27 13:35:48 2003