Regularization#

Impose certain size contraints on weights

Why?#

Might not want too much importance attatched to certain features

L1#

lasso

prefers sparse vectors with some large weights

L2#

ridge

prefers many small weights