Machine Learning

11.9 Kernel Ridge Regression Revisited

The kernel ridge regression was introduced in Section 11.7. Here, it will be restated via its dual representation form. The ridge regression in its primal representation can be cast as

$\begin{array}{l} minimize with respect to θ, ξ & J (θ, ξ) = \sum_{n = 1}^{N} ξ_{n}^{2} + C ∥ θ ∥^{2}, \\ subject to & y_{n} - θ^{T} x_{n} = ξ_{n}, n = 1, 2, \dots, N, \end{array}$

si176_e (11.51)

which leads to the following Lagrangian:

$\begin{array}{l} L (θ, ξ, λ) = \sum_{n = 1}^{N} ξ_{n}^{2} + C ∥ θ ∥^{2} + \sum_{n = 1}^{N} λ_{n} (y_{n} - θ^{T} x_{n} - ξ_{n}), n = 1, 2, \dots, N . \end{array}$

si177_e (11.52)

Differentiating with respect to θ and ξ_n, n = 1,2,…,N, and equating to zero, we obtain

$\begin{array}{l} θ = \frac{1}{2 C} \sum_{n = 1}^{N} λ_{n} x_{n} \end{array}$

(11.53)

and

$\begin{array}{l} ξ_{n} \end{array}$

Get Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Machine Learning by

11.9 Kernel Ridge Regression Revisited

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly