Linear Regression
Stats 67
Least Squares Regression Line
r2 is the coefficient of determination which describes the strength of the line (or how accurate it is)
- Can be used to interpret the data: “_% of the variance in y can be explained by x”
r represents the direction and the strength of the relationship (weak/moderate/strong, negative/posutive)
- “There is a (strength) (nega/posi) relationship betweenn IV and DV. As IV increases, DV (increases/decreases) on average.”
r2 is resistant to outliers while r is not
Formulas for the LSRL:
Must make three assumptions to make a model:
- The experimental units are independent from each other
- The error follows a Normal distribution
- The residual plot should look random
Distributions: