linear regression
given a training data set comprising
observations
, where
, together with corresponding target values
, the goal is to predict the value of
for a new value of
with a predictive hypothesis function
:
but to make our hypothesis applicable to more function spaces we extend this linear function with a set of
fixed, non-linear basis functions
such that
for the bias parameter, such that
this gets rid of the restrictions we would be imposing by using linear functions of
, this is however still a linear function of
which eases analysis.
here,
is the weight matrix which we hope would represent a linear transformation that transforms a vector of input features
into an output vector
.
[cite:@ml_bishop_2006 section 3.1 linear basis function models]
we start with a random point in here,
[cite:@ml_bishop_2006 section 3.1 linear basis function models]