Mean Squared Error (Mse)
- AKA MSE
- AKA L2-loss
- MSE is used in Regression problems
- Sensitive to large errors for the squaring
- Gradient decreases as loss gets closer to 0
[!def] Mean Squared Error Formula
$$
MSE = \frac{1}{n} \sum_{i=1}^n (Y_i - \hat{Y}_i)^2
$$
Pros:
- Convex function, so one Global Minima
- No Local Minima
- If error is big, penalizes higher by squaring them
- Good for predicting outliers as give more weight to them
Cons:
- Bad at Handling Outliers
- Same error for one big error and many small errors