So if regression doesn't prove causation, what does?

Getting started on your data analysis

  1. Histogram each variable (check for outliers, get to know your data)
  2. Scatter plot your continuous variables

  3. Edit data if necessary (you might have found some strange points)
  4. color code for some of the more important categorial variables (pink/blue say). Look at the previous histograms and scatter plots with the new colors--see if anything striking happens.
  5. Start multiple regression and run full model

  6. Build a reasonable model

  7. check your residuals (linearity via Y vs Y-hat plot), normality via a historgram and a normal probabiltiy plot, hetroskadasticy by looking at residuals vs your important X variables.
  8. Build another model
  9. repeat!

Observational studies vs controled experiments


Last modified: Thu Apr 13 13:10:42 2000