Artificial intelligence and machine learning in physics

Contents

From OLS to Ridge and Lasso

By minimizing the above equation with respect to the parameters $ \boldsymbol{\theta} $ we could then obtain an analytical expression for the parameters $ \boldsymbol{\theta} $. We can add a regularization parameter $ \lambda $ by defining a new cost function to be optimized, that is

$$ {\displaystyle \min_{\boldsymbol{\theta}\in {\mathbb{R}}^{p}}}\frac{1}{n}\vert\vert \boldsymbol{y}-\boldsymbol{X}\boldsymbol{\theta}\vert\vert_2^2+\lambda\vert\vert \boldsymbol{\theta}\vert\vert_2^2 $$

which leads to the Ridge regression minimization problem where we require that $ \vert\vert \boldsymbol{\theta}\vert\vert_2^2\le t $, where $ t $ is a finite number larger than zero. We do not include such a constraints in the discussions here.