The cvx suite for MATLAB can solve the (seemingly innocent) optimization problem below, but it is rather slow for the large, full matrices I'm working with. I'm hoping this is because using cvx is overkill, and that the problem actually has an analytic solution, or that a clever use of some built-in MATLAB functions can more quickly do the job. Background: It is well-known that both <code>x1=A\b</code> and <code>x2=pinv(A)*b</code> solve the least-squares problem: <pre class="prettyprint"><code>minimize norm(A*x-b) </code></pre> with the distinction that <code>norm(x2)<=norm(x1)</code>. In fact, <code>x2</code> is the minimum-norm solution to the problem, so <code>norm(x2)<=norm(x)</code> for all possible solutions <code>x</code>. Defining <code>D=norm(A*x2-b)</code>, (equivalently <code>D=norm(A*x1-b)</code>), then <code>x2</code> solves the problem <pre class="prettyprint"><code>minimize norm(x) subject to norm(A*x-b) == D </code></pre> Problem: I'd like to find the solution to: <pre class="prettyprint"><code>minimize norm(x) subject to norm(A*x-b) <= D+threshold </code></pre> In words, I don't need <code>norm(A*x-b)</code> to be as small as possible, just within a certain tolerance. I want the minimum-norm solution <code>x</code> that gets <code>A*x</code> within <code>D+threshold</code> of <code>b</code>. I haven't been able to find an analytic solution to the problem (like using the pseudoinverse in the classic least-squares problem) on the web or by hand. I've been searching things like "least squares with nonlinear constraint" and "least squares with threshold". Any insights would be greatly appreciated, but I suppose my real question is: What is the fastest way to solve this "thresholded" least-squares problem in MATLAB?

Interesting question. I do not know the answer to your exact question, but a working solution is presented below. <h3>Recap</h3> Define <code>res(x) := norm(Ax - b)</code>. As you state, <code>x2</code> minimizes <code>res(x)</code>. In the overdetermined case (typically <code>A</code> having more rows than col's), <code>x2</code> is the unique minimum. In the underdetermined case, it is joined by infinitely many others*. However, among all of these, <code>x2</code> is the unique one that minimizes <code>norm(x)</code>. To summarize, <code>x2</code> minimizes (1) <code>res(x)</code> and (2) <code>norm(x)</code>, and it does so in that order of priority. In fact, this characterizes (fully determines) <code>x2</code>. <h3>The limit characterization</h3> But, another characterization of <code>x2</code> is <pre class="prettyprint"><code>x2 := limit_{e-->0} x_e </code></pre> where <pre class="prettyprint"><code>x_e := argmin_{x} J(x;e) </code></pre> where <pre class="prettyprint"><code>J(x;e) := res(x) + e * norm(x) </code></pre> It can be shown that <pre class="prettyprint"><code>x_e = (A A' + e I)^{-1} A' b (eqn a) </code></pre> It should be appreciated that this characterization of <code>x2</code> is quite magical. The limits exists even if <code>(A A')^{-1}</code> does not. And the limit somehow preserves priority (2) from above. <h3>Using e>0</h3> Of course, for finite (but small) <code>e</code>, <code>x_e</code> will not minimize <code>res(x)</code>(instead it minimzes <code>J(x;e)</code>). In your terminology, the difference is the threshold. I will rename it to <pre class="prettyprint"><code>gap := res(x_e) - min_{x} res(x). </code></pre> Decreasing the value of <code>e</code> is guaranteed to decrease the value of the <code>gap</code>. Reaching a specific <code>gap</code> value (i.e. the threshold) is therefore easy to achieve by tuning <code>e</code>.** This type of modification (adding <code>norm(x)</code> to the <code>res(x)</code> minimization problem) is known as regularization in statistics litterature, and is generally considered a good idea for stability (numerically and with respect to parameter values). <hr> *: Note that <code>x1</code> and <code>x2</code> only differ in the underdetermined case **:It does not even require any heavy computations, because the inverse in <code>(eqn a)</code> is easily computed for any (positive) value of <code>e</code> if the SVD of A has already been computed.

Least-squares minimization within threshold in MATLAB

Q: How to find least squares and least norm in MATLAB?

Least squares and least norm in Matlab. Least squares approximate solution. Suppose A 2 Rm n is skinny (or square), i.e., m n, and full rank, which means that Rank(A) = n. The least-squares approximate solution of Ax = y is given by xls = (ATA) 1ATy: This is the unique x 2 Rn that minimizes kAx yk. There are several ways to compute xls in Matlab.

Q: What is least squares algorithm?

Least-Squares (Model Fitting) Algorithms. Least Squares Definition. Least squares, in general, is the problem of finding a vector x that is a local minimizer to a function that is a sum of squares, possibly subject to some constraints: such that A·x ≤ b, Aeq·x = beq, lb ≤ x ≤ ub.

Q: How to solve optimization problems with MATLAB?

▪Select an appropriate solver and algorithm ▪Interpret the output from the solver and diagnose the progress of an optimization © 2018 The MathWorks, Inc.110 Title Solving Optimization Problems with MATLAB Author Mary Fenelon Keywords Version 16.0 Created Date 10/31/2018 9:50:16 AM

Tags:

optimization

matlab

linear-algebra

least-squares

cvx

The cvx suite for MATLAB can solve the (seemingly innocent) optimization problem below, but it is rather slow for the large, full matrices I'm working with. I'm hoping this is because using cvx is overkill, and that the problem actually has an analytic solution, or that a clever use of some built-in MATLAB functions can more quickly do the job.

Background: It is well-known that both x1=A\b and x2=pinv(A)*b solve the least-squares problem:

Click to copy

minimize norm(A*x-b)

with the distinction that norm(x2)<=norm(x1). In fact, x2 is the minimum-norm solution to the problem, so norm(x2)<=norm(x) for all possible solutions x.

Defining D=norm(A*x2-b), (equivalently D=norm(A*x1-b)), then x2 solves the problem

Click to copy

minimize norm(x)
subject to
norm(A*x-b) == D

Problem: I'd like to find the solution to:

Click to copy

minimize norm(x)
subject to
norm(A*x-b) <= D+threshold

In words, I don't need norm(A*x-b) to be as small as possible, just within a certain tolerance. I want the minimum-norm solution x that gets A*x within D+threshold of b.

I haven't been able to find an analytic solution to the problem (like using the pseudoinverse in the classic least-squares problem) on the web or by hand. I've been searching things like "least squares with nonlinear constraint" and "least squares with threshold".

Any insights would be greatly appreciated, but I suppose my real question is: What is the fastest way to solve this "thresholded" least-squares problem in MATLAB?

525

asked Dec 18 '15 18:12

Geoff

1 Answers

Interesting question. I do not know the answer to your exact question, but a working solution is presented below.

Recap

Define res(x) := norm(Ax - b). As you state, x2 minimizes res(x). In the overdetermined case (typically A having more rows than col's), x2 is the unique minimum. In the underdetermined case, it is joined by infinitely many others*. However, among all of these, x2 is the unique one that minimizes norm(x).

To summarize, x2 minimizes (1) res(x) and (2) norm(x), and it does so in that order of priority. In fact, this characterizes (fully determines) x2.

The limit characterization

But, another characterization of x2 is

Click to copy

x2 := limit_{e-->0} x_e

where

Click to copy

x_e := argmin_{x} J(x;e)

where

Click to copy

J(x;e) := res(x) + e * norm(x)

It can be shown that

Click to copy

x_e = (A A' + e I)^{-1} A' b      (eqn a)

It should be appreciated that this characterization of x2 is quite magical. The limits exists even if (A A')^{-1} does not. And the limit somehow preserves priority (2) from above.

Using e>0

Of course, for finite (but small) e, x_e will not minimize res(x)(instead it minimzes J(x;e)). In your terminology, the difference is the threshold. I will rename it to

Click to copy

gap := res(x_e) - min_{x} res(x).

Decreasing the value of e is guaranteed to decrease the value of the gap. Reaching a specific gap value (i.e. the threshold) is therefore easy to achieve by tuning e.**

This type of modification (adding norm(x) to the res(x) minimization problem) is known as regularization in statistics litterature, and is generally considered a good idea for stability (numerically and with respect to parameter values).

*: Note that x1 and x2 only differ in the underdetermined case

**:It does not even require any heavy computations, because the inverse in (eqn a) is easily computed for any (positive) value of e if the SVD of A has already been computed.

answered Oct 08 '22 08:10

Patrick

Related questions
                            
                                Scipy LinearOperator With Multiple Inputs
                            
                                Projection property of axes in octave
                            
                                Is my implementation of stochastic gradient descent correct?
                            
                                Getting heart rate from simple video: code inside
                            
                                Bad version or endian-key in MATLAB parfor?
                            
                                Matlab engines within parallel loop
                            
                                Printing Umlauts in Matlab
                            
                                colormap/datatip issue in Matlab figure
                            
                                How can we convert LaTeX representation to symbolic math functions in matlab?
                            
                                How to use MATLAB to send signals to a port of an IP address?
                            
                                Calibration of images to obtain a top-view for points that lie on a same plane
                            
                                Matlab - URLREAD2 - User Agent and Cookies
                            
                                Maximum packing of rectangles in a circle
                            
                                Which was the latest Matlab version that allowed to install MCR without Administrator rights?
                            
                                Word wrap in MATLAB editor
                            
                                Does MATLAB keep some variables after clearing?
                            
                                Matlab Cascade train for bees counting
                            
                                Octave - .m-File Compiler?
                            
                                Pre-processing before digit recognition for NN & CNN trained with MNIST dataset
                            
                                Circular shift of patches in matrix in MATLAB

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Least-squares minimization within threshold in MATLAB

Tags:

optimization

matlab

linear-algebra

least-squares

cvx

Geoff

People also ask

1 Answers

Recap

The limit characterization

Using e>0

Patrick

Recent Activity

Donate For Us