Perform backward elimination to remove the least predictive variable from the model, returning the variable to eliminate, the new parameter vector, the new R-squared value and the new F statistic.
Return the fit (parameter vector b, quality of fit rSquared).
Show the flaw by printing the error message.
Show the flaw by printing the error message.
the method where the error occurred
the error message
Predict the value of y = f(z) by evaluating the formula y = b dot z for each row of matrix z.
Predict the value of y = f(z) by evaluating the formula y = b dot z for each row of matrix z.
the new matrix to predict
Predict the value of y = f(z) by evaluating the formula y = b dot z, e.
Predict the value of y = f(z) by evaluating the formula y = b dot z, e.g., (b0, b1, b2) dot (1, z1, z2).
the new vector to predict
Given a new discrete data vector z, predict the y-value of f(z).
Given a new discrete data vector z, predict the y-value of f(z).
the vector to use for prediction
Retrain the predictor by fitting the parameter vector (b-vector) in the multiple regression equation yy = b dot x + e = [b0, .
Retrain the predictor by fitting the parameter vector (b-vector) in the multiple regression equation yy = b dot x + e = [b0, ... bk] dot [1, x1 , ... xk] + e using the least squares method.
the new response vector
Train the predictor by fitting the parameter vector (b-vector) in the multiple regression equation y = b dot x + e = [b0, .
Train the predictor by fitting the parameter vector (b-vector) in the multiple regression equation y = b dot x + e = [b0, ... bk] dot [1, x1 , ... xk] + e using the least squares method.
Compute the Variance Inflation Factor (VIF) for each variable to test for multi-colinearity by regressing xj against the rest of the variables.
Compute the Variance Inflation Factor (VIF) for each variable to test for multi-colinearity by regressing xj against the rest of the variables. A VIF over 10 indicates that over 90% of the varaince of xj can be predicted from the other variables, so xj is a candidate for removal from the model.
The
Regression
class supports multiple linear regression. In this case, 'x' is multi-dimensional [1, x1, ... xk]. Fit the parameter vector 'b' in the regression equationy = b dot x + e = b0 + b1 * x1 + ... bk * xk + e
where 'e' represents the residuals (the part not explained by the model). Use Least-Squares (minimizing the residuals) to fit the parameter vector
b = x_pinv * y
where 'x_pinv' is the pseudo-inverse. By default QR Decomposition (more robust) is used to compute 'x_pinv', with Gaussian Elimination as an option (set useQR to false).
see.stanford.edu/materials/lsoeldsee263/05-ls.pdf