Consider the following example, because of the conditional independance of measurements given theta, the formula (1) can be broke down into the formula (2)
Note that we did not solved Ax=b, in general no exact solution because of noise, instead we solved argmin (Ax-b)^2
Note that the smaller the variance, the larger the coefficient, which would influence the solution more