Goodness of Fit

How do we quantify the quality of fit?

The sum of squares of residuals ( $S_{r}$ ) is a quantification of the error between the measured and predicted $y$ values after regression:

S_{r} = i = 1 \sum n (y_{i} - a_{0} - a_{1} x_{i})^{2}

The total sum of squares around the mean value is the magnitude of the residual error associated with the dependent variable prior to regression:

S_{t} = i = 1 \sum n (y_{i} - \overset{y}{ˉ})^{2}

We can then use then use the difference between $S_{t}$ and $S_{r}$ to quantify the improvement or error reduction. To do this, we define:

Coefficient of Determination: Correlation Coefficient: r^{2} = \frac{S _{t} - S _{r}}{S _{t}} r

If we have $r^{2} = 1$ , that means that the line explains all of the variability of the data and is therefore a “perfect fit”. On the other hand, $r^{2} = 0$ would be a poor fit.

Something like $r^{2} = 0.868$ means that 86% of the uncertainty is explained by the linear model.

It’s still important to check results visually, as $r^{2}$ and $r$ can trick you!

/notes/

Recent

Particle Filter

Bayes Filter

Harris Corner Detector

Goodness of Fit

Graph View

Backlinks