使用批量梯度下降的错误权重

Question

我正在使用二维数据进行线性回归，但无法获得回归线的正确权重。下面的代码似乎有问题，因为回归线的计算权重不正确。使用太大的数据值（x 约为 80000）会导致权重为 NaN。将数据从 0 缩放到 1，会导致权重错误，因为回归线与数据不匹配。

function [w, epoch_batch, error_batch] = batch_gradient_descent(x, y)

% number of examples
q = size(x,1);

% learning rate
alpha = 1e-10;

w0 = rand(1);
w1 = rand(1);

curr_error = inf;
eps = 1e-7;

epochs = 1e100;
epoch_batch = 1;
error_batch = inf;
for epoch = 1:epochs
    prev_error = curr_error;
    curr_error = sum((y - (w1.*x + w0)).^2);
    w0 = w0 + alpha/q * sum(y - (w1.*x + w0));
    w1 = w1 + alpha/q * sum((y - (w1.*x + w0)).*x);
    if ((abs(prev_error - curr_error) < eps))
        epoch_batch = epoch;
        error_batch = abs(prev_error - curr_error);
        break;
    end
end

w = [w0, w1];

你能告诉我我在哪里犯了错误吗，因为对我来说，经过数小时的尝试，它似乎是正确的。

数据：

这是绘制数据的代码：

figure(1)
% plot data points
plot(x, y, 'ro');
hold on;
xlabel('x value');
ylabel('y value');
grid on;

% x vector from min to max data point
x = min(x):max(x);
% calculate y with weights from batch gradient descent
y = (w(1) + w(2)*x);
% plot the regression line
plot(x,y,'r');

可以使用较小的学习率 alpha = 1e-10 找到未缩放数据集的权重。但是，当将数据从 0 缩放到 1 时，我仍然无法获得匹配的权重。

scaled_x =

scaled_y_en =

Answer 1

问题出在 w1，因为您赋予它的权重过大。你不应该给 w0 和 w1 相同的学习步骤，因为一个不乘以 x。

如果我用 alpha^4/q 代替 alpha/q（因为是随机选择），那么它会收敛：

使用批量梯度下降的错误权重

Wrong weights using batch gradient descent

matlab

machine-learning

linear-regression

gradient-descent