SLIDE 1
Notes on Error Propagation in Linear Systems
CS3220 Summer 2008 - Jonathan Kaldor Up to this point, we have talked about solving n × n linear systems Ax = b where A is of full rank. We have talked about, in passing, ”difficult” systems to solve and made oblique mentions of matrices that are ”nearly singular”, but to this point we have assumed that as long as our matrix is not strictly singular, we can compute the answer by applying the LU
- factorization. Although this is true in a mathematical sense, what we will look at today
are linear systems that, although technically nonsingular and thus invertible, have solutions that are highly dependent on small changes in the values of the known components (i.e. the entries of A and b). The technical term used to describe this is called ”conditioning” - we say that a problem is well-conditioned when the answer does not change much for small perturbations in the inputs, and correspondingly a problem is ill-conditioned when small changes in the inputs produce large changes in the answer. Before we get into the details, why do we need to consider error? To begin with, oftentimes
- ur inputs come from experimental measurements, and there may be sources of error or
inaccuracies; for instance, we may only be confident in our inputs to 3 or 4 significant
- digits. Beyond that, the introduction of floating point numbers introduces additional, albeit
relatively small, errors. Although the propagation of error due to floating point arithmetic is an interesting topic in its own right, for the most part we will be considering errors in the input numbers only. Lets look at an example to make this concrete. Take the linear system 1 1 1 1.0001
- x =
1 1
- We can see that the answer to this system is x =
1
- . Suppose that our right hand side
is perturbed slightly, so instead we have b′ = 1.0001 0.9999
- . Our solution then becomes x′ =
3.0001 −2
- . Introducing an error of magnitude approximately 0.00014 in our inputs has
resulted in a change of magnitude 2 to 3 in each of the components of the computed answer. Even if we compute the introduced error in a relative sense - x′−x2
x2
compared to b′−b2
b2
- we see that we have introduced a small relative error in our right hand side that results in