Questions about the Lsqnonlin function

Question

ma Jack on 2 Jul 2022

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/1752230-questions-about-the-lsqnonlin-function

Commented: ma Jack on 6 Jul 2022

Accepted Answer: John D'Errico

Open in MATLAB Online

2022/7/2

Hi all,

Are there any experts here who use the Lsqnonlin function?I'm trying to use the Lsqnonlin function to optimize a more complex function so that its result converges to 0 (i.e. I expect to find an input value that makes the function result most closely converge to 0),But it reports an error, it says error using eig function because the input matrix contains NAN or inf, I don't know how to fix it, can anyone give me an answer? thanks in advance!

By the way the function I want to optimize is a determinant, I found that as long as I reduce the size of the determinant to 4*4, Lsqnonlin does not report an error, but as soon as I make the size of the determinant bigger, it reports an error.

The following is the error information：

Wrong use of eig

The input matrix contains NaN or Inf.

error trust (line 33)

[V,D] = eig(H);

error trdog (line 132)

st = trust(rhs,MM,delta);

error snls

error lsqncommon (line 169)

snls(funfcn,xC,lb,ub,flags.verbosity,options,defaultopt,initVals.F,initVals.J,caller,

...

error lsqnonlin (line 258)

lsqncommon(funfcn,xCurrent,lb,ub,options,defaultopt,allDefaultOpts,caller,...

2022/7/3

Hi all,

This is my code, it is a function file not a script file, where Zero_seek_Main is the main function, the other functions are sub-functions (in order to construct the matrix).

When I reduce the size of the GvXY matrix (4*4 and below), lsqnonlin does not report an error, which is a bit strange.

Looking forward to any reply.

function []=Zero_seek_Main()
clear;clc;close all
%=====================
d=500*1e-9;
rx=d/4;
ry=rx;
num_Re=5;
num_Im=5;
%====================
%==========%
lsqoptions = optimoptions(@lsqnonlin,'Display','none');
%==========%
Real_w0=linspace(0.8,1.2,num_Re);
Imag_w0=linspace(-0.2,0.2,num_Im);
Result_stoXY=zeros(num_Re*num_Im,1);
count=0;
for uu=1:num_Re
        
        for vv=1:num_Im
            count=count+1;
            
            mark=count/(num_Re*num_Im)
            
            Re_w=Real_w0(uu);
            Im_w=Imag_w0(vv);
            w=Re_w+1j*Im_w;
            
            
            FUNXY=@(x)det(GvXY(x,d,rx,ry));
            
            %======================
            [find_x,~,~,~]=lsqnonlin(FUNXY,w,[],[],lsqoptions);
            Result_stoXY(count,1)=find_x;
            %======================
        end
end
     
figure
Result_index=[1:length(Result_stoXY)].';
plot(Result_index,Result_stoXY,'bo')
            
            
            
end
function [ModeXY]=GvXY(w,d,rx,ry)
c0=299792458;%light
wsp=3.742*1e15;%rad/s
eps0=8.8541*1e-12;
k=w*wsp/c0;
M1=RegSigGtXY(k,d);%size:2*2
M2=SigGtXY(k,rx,ry,d);%size:2*2
ModeXY=[M1,M2,M2,M2;...
                  M2,M1,M2,M2;...
                  M2,M2,M1,M2;...
                  M2,M2,M2,M1];%size:(2*4,2*4)
ModeXY=(k^2)/eps0.*ModeXY;
ModeXY(isnan(ModeXY))=0;%set NAN to 0
ModeXY(isinf(ModeXY))=0;%set INF to 0
%ModeXY=ModeXY(1:4,1:4);%If I enable this command. Then lsqnonlin will not 
%report an error, if I increase the 
%size of the matrix (for example, 5*5) lsqnonlin will report an error
end
function [o4]=RegSigGtXY(k,d)
N=1;
index_sto=zeros((2*N+1)^2-1,2);
cc=0;
for m=-N:N
    for n=-N:N
        
        if m==0&&n==0
         cc=cc;
        else
        cc=cc+1;
        
        index_sto(cc,1:2)=[m,n];
        end
        
    end
end
Sum_sto11=zeros((2*N+1)^2-1,1);%xx
Sum_sto22=zeros((2*N+1)^2-1,1);%yy
Sum_sto12=zeros((2*N+1)^2-1,1);%xy
Sum_sto21=zeros((2*N+1)^2-1,1);%yx
parfor ii=1:size(index_sto,1)
m=index_sto(ii,1);
n=index_sto(ii,2);
%==================
nR=abs(m*d+1j*n*d);
Rx=m*d;
Ry=n*d;
B=BB(k,nR);
A=AA(k,nR);
term=exp(1j*k*nR)/(4*pi*nR);
%==================
Sum_sto11(ii,1)=B/(nR^2)*(Rx^2)*term+A*term;
Sum_sto22(ii,1)=B/(nR^2)*(Ry^2)*term+A*term;
Sum_sto12(ii,1)=B/(nR^2)*Rx*Ry*term;
Sum_sto21(ii,1)=B/(nR^2)*Ry*Rx*term;
end
o4=zeros(2,2);
o4(1,1)=sum(Sum_sto11);
o4(2,2)=sum(Sum_sto22);
o4(1,2)=sum(Sum_sto12);
o4(2,1)=sum(Sum_sto21);
end
function [o3]=SigGtXY(k,rx,ry,d)
N=1;
index_sto=zeros((2*N+1)^2,2);
cc=0;
for m=-N:N
    for n=-N:N
        cc=cc+1;
        
        index_sto(cc,1:2)=[m,n];
    end
end
Sum_sto11=zeros((2*N+1)^2,1);%xx
Sum_sto22=zeros((2*N+1)^2,1);%yy
Sum_sto12=zeros((2*N+1)^2,1);%xy
Sum_sto21=zeros((2*N+1)^2,1);%yx
parfor ii=1:size(index_sto,1)
m=index_sto(ii,1);
n=index_sto(ii,2);
%=====================
nR=abs(m*d+n*d*1j+rx+ry*1j);
Rx=m*d+rx;
Ry=n*d+ry;
Term=exp(1j*k*nR)/(4*pi*nR);
A=AA(k,nR);
B=BB(k,nR);
%=====================
Sum_sto11(ii,1)=B/(nR^2)*(Rx^2)*Term+A*Term;
Sum_sto22(ii,1)=B/(nR^2)*(Ry^2)*Term+A*Term;
Sum_sto12(ii,1)=B/(nR^2)*Rx*Ry*Term;
Sum_sto21(ii,1)=B/(nR^2)*Ry*Rx*Term;
end
o3=zeros(2,2);
o3(1,1)=sum(Sum_sto11);
o3(2,2)=sum(Sum_sto22);
o3(1,2)=sum(Sum_sto12);
o3(2,1)=sum(Sum_sto21);
end
function [o1]=AA(k,R)
o1=1+(1j*k*R-1)/((k*R)^2);
end
function [o2]=BB(k,R)
o2=(3-3*1j*k*R-(k*R)^2)/((k*R)^2);
end

2022/7/3 pm

Hi all,

Some people have asked me why I require the determinant of the matrix, and the explanation is as follows:

Obviously, it is a characteristic equation,But note that the eigenvector [a;b;c;d] is unknown, and omega is also unknown, in order to ensure that the eigenvector [a;b;c;d] is not [0;0;0;0], so I need to make det(Gv(w)-lam(w). *eye(4))=0, or converge to 0, because in practice I found it impossible to find the right omega to make det=0.And also I found that fsolve is not as effective as lsqnonlin, or not effective at all.

Dear experienced experts, how do you solve the above problems in your actual projects? It should be very common in practice, right?

14 Comments
Show 12 older commentsHide 12 older comments

Torsten on 5 Jul 2022

I don't understand what you mean.

Searching for a vector x=[a b c d] with norm 1 excludes the case of the trivial solution x = 0.

ma Jack on 6 Jul 2022

Sorry sir, I didn't describe the problem clearly, the vectors [a;b;c;d] are completely unknown (no expressions), only Gv(w) and lambda(w) are known (with specific formulas)

Sign in to comment.

Sign in to answer this question.

Answer 1

John D'Errico on 2 Jul 2022

3
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/1752230-questions-about-the-lsqnonlin-function#answer_998740

Edited: John D'Errico on 2 Jul 2022

Open in MATLAB Online

This is NOT a problem with lsqnonlin, but in your understanding of determinants and why they are a bad thing to use.

You misunderstand the problem. You want to compute a determinant. Well, you THINK you want to do that. But determinants are a classically BAD thing to work with. They are numerically unstable for a variety of reasons. For example, do you agree that the determinant of

A = eye(1200);

has no infs or NaNs in it? Does it return something meaningful for the determinant?

det(A)
ans = 1

WOW!. It does. And that is exactly what I would have expected. Next, if we just multiply A by 2, does B have any infs or NaNs in it? (I hope not.)

B = 2*A;
det(B)
ans = Inf

Surely B had nothing in it that caused the problem. Could it have been lsqnonlin?

How about C?

C = A/2;
det(C)
ans = 0

Are any of those matrices singular, thus A, B, or C? Of course not. In fact, they are all extremely well behaved.

So, it must have been lsqnonlin that caused this problem? But, wait. I never used lsqnonlin here. I wonder what happened?

The problem in the above examples is that determinants have nasty scaling problems. But they are also bad things to use to use in general, even for smaller matrices. For example, consider this simple, smaller matrix.

D = rand(7,6)*1000;
D = D*rand(6,7);

I KNOW, ABSOLUTELY KNOW that this matrix is singular. It must have a zero determinant, right?

det(D)
ans = -4.9378e+03

And that was just a 7x7 matrix, composed of moderately small entries. Nothing too large. Nothing terribly small. But the determinat came out being pretty darn large, compared to the zero number you would be looking to find.

The point being that determinants are NOT a good way to decide if a matrix is singular. They are NOT a good thing to use for whatever your task may be.

Yes, I know, your teachers told you to use determinants. Their teachers told them the same thing. But nobody ever explained that in terms of numerical compuutations, the things are just useless, at least when double precision arithmetic is used to compute them. In fact, they are not just useless, but actively bad because they lead people to do silly things like this. That was the class your teachers probably forgot to give, even if they knew somewhere in the backs of their minds that sometimes there are problems. And of course, everything works great when you compute the determinant of a 3x3 integer matrix in that homework assignment for class. So what could possibly go wrong?

Don't believe me? Try this one, this time using integers. The nice thing is now I can use sym to compute the determinant too.

E = randi(10,[10,9])*randi(10,[9,10])
E = 10×10
 200   271   311   399   261   412   424   239   373
 161   222   248   275   220   269   260   227   289
 182   216   238   240   177   282   227   150   271
 222   299   307   392   298   431   408   300   380
 188   299   283   378   255   412   391   259   377
 139   170   236   226   131   232   272   148   290
 164   242   275   261   187   294   288   225   336
 246   349   361   458   334   460   436   310   442
 178   239   289   346   225   330   346   260   341
 235   258   291   345   256   325   294   255   344

Again, EXACT INTEGERS. But the way I created the matrix, I KNOW that it is singular. It has rank at most 9, for a 10x10 matrix since it is the product of 10x9 and 9x10 matrices.

Now compute the determinant in double precision arithmetic, and then using symbolic tools.

det(E)
ans = -26.3364
det(sym(E))
ans = 0

Do you see the difference? You CANNOT effectively use a determinant for such a task, searching for a zero. They are poorly behaved, almost always.

Again, ABSOLUTELY NOTHING to do with lsqnonlin.

We can understand what is happening, if we look at an LU decomposition of these matrices.

[L,U] = lu(E);
format long g
diag(U)
ans = 10×1
                       319
          56.0909090909091
          27.6457273794221
          42.1032597818726
         -96.2713298870711
          -30.455195763015
           56.635421538508
         -20.1280502645681
         -29.7973765940712
     -1.26970747187275e-14

Remember that L and U are lower and upper triangular matrices. As well, the diagonal elements of L are all exactly 1. So the determinant of E is just the product of the diagonal elements of U. Do you see that one of those numbers is not exactly zero, but only close to it. As close as we can get in double precision arithmetic.

prod(diag(U))
ans = 
         -26.3364250073676

So even though we know that E is singular, the product of those diagonal elements is pretty far away from zero. And so was the determinant.

Again, I don't know why you think you need to compute a determinant here. You don't want to do it. No matter what someone told you, or what you remember from some barely remembered class, you don't want to do it. Perhaps you can show what you are trying to do, then we could offer some advice on how to do it better. Lacking that, all I can say is DON'T DO WHAT YOU ARE DOING.

1 Comment
Show -1 older commentsHide -1 older comments

ma Jack on 3 Jul 2022

Thank you very much for your detailed answer sir, I can't believe that all these are true, it seems that my math knowledge is still too poor. Also I have updated my answer, it includes my code and why I asked for a value so that det(Matrix) = 0 (or tends to 0).

Sign in to comment.

Questions about the Lsqnonlin function

14 Comments
Show 12 older commentsHide 12 older comments

Accepted Answer

1 Comment
Show -1 older commentsHide -1 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Questions about the Lsqnonlin function

14 Comments Show 12 older commentsHide 12 older comments

Accepted Answer

1 Comment Show -1 older commentsHide -1 older comments

More Answers (0)

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

14 Comments
Show 12 older commentsHide 12 older comments

1 Comment
Show -1 older commentsHide -1 older comments