You don't necessarily want the optimal solution, L2 regularization can give you ...

		elcomet on April 26, 2023 \| parent \| context \| favorite \| on: Double Descent in Human Learning You don't necessarily want the optimal solution, L2 regularization can give you a slightly worse solution on the training set but that will be better at generalization on unseen data.