Log-cosh Loss
- Similar to Huber Loss but without the hyperparameter $\delta$
- It also behaves like squared error for small values and absolute error for larger values.
[!def] Log-cosh Loss Formula
$$
L = \sum_i log(cosh(\hat{y_i} - y_i))
$$
Pros:
- Benefits of Huber Loss with twice differentiability
- Fewer computation than Huber Loss
Cons:
- Compared to Huber Loss, more complex Derivative