5. Positive Definite and Semidefinite Matrices

MIT OpenCourseWare

zhlédnutí 151 319

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 15. 05. 2019
MIT 18.065 Matrix Methods in Data Analysis, Signal Processing, and Machine Learning, Spring 2018
Instructor: Gilbert Strang
View the complete course: ocw.mit.edu/18-065S18
CZcams Playlist: • MIT 18.065 Matrix Meth...
In this lecture, Professor Strang continues reviewing key matrices, such as positive definite and semidefinite matrices. This lecture concludes his review of the highlights of linear algebra.
License: Creative Commons BY-NC-SA
More information at ocw.mit.edu/terms
More courses at ocw.mit.edu

Komentáře • 101

@georgesadler7830 Před 2 lety ⁺¹³
DR. Strang thank you for another classic lecture and selection of examples on Positive Definite and Semidefinite Matrices.
@spoopedoop3142 Před 3 lety ⁺⁹
For everyone asking about the bowl and eigenvalues analogy:
Let X= (x,y) be the input vector (so that I can write X as a vector) and consider the energy functional f(X)=X^t S X. What would happen if we evaluate on the eigenvalues?
First, why would I think to do this? The eigenvectors of the matrix give the "natural coordinates" to express the action of the matrix as a linear transformation, which then gives rise to all the "completing the square" type problems with quadratic forms in usual LA classes. The natural coordinates rotate the quadratic so it doesn't have off-diagonal terms. This means the function changes from something like f(x,y)=3x^2+6y^2+4xy to something like f(x,y)=(x^2+y^2)=(||X||^2), where ||X||^2 denotes the squared norm. So the functional looks like a very nice quadratic in this case, like the ones you may learn how to draw in a multivariate calc course.
Going back to the current calculation which f(X)=X^tSX: if we evaluate in the eigen-directions, then our function becomes f(X_1)=X_1^t S X_1=X_1 lambda_1 X_1= lambda_1 ||X_1||^2 (a nice quadratic) and
f(X_2)=X_2^t S X_2=X_2 lambda_2 X_2= lambda_2 ||X_2||^2 (another nice quadratic). The eigenvalues lambda_1, lambda_2 become scaling coefficients in the eigen-directions. A large scaling coefficient means we have a steep quadratic and a small coefficient means we have a quadratic that is stretched out horizontally.
If the eigenvalue is close to zero, the quadratic functional will almost look like a horizontal plane (really, the tangent plane will be horizontal) and hence not be invertible, so any solver will have difficulty finding a solution due to infinitely many approximate solutions. Since the solver will see a bunch of feasible directions, it will bounce around the argmin vector without being able to confidently declare success. Poor solver. Of course, these are purely mathematical problems; rounding error will probably mitigate the search even further.
Edit: changed "engenvalue" to "eigenvector" in 2nd paragraph.
@mariomariovitiviti Před 3 lety ⁺³⁴
listening to Strang is like getting a brain massage
@CrazyHorse151 Před 3 lety ⁺²
Im only half through one lecture and I already love him. :'D
@PremiDhruv Před 11 měsíci
I was going through a headache, after 15 minutes of his lecture it got evaporated.
@emanueleria8151 Před 2 měsíci
Sure
@sable2x Před 4 lety ⁺⁷³
I wish Strang was my grandfather
@NguyenAn-kf9ho Před 3 lety ⁺⁸
maybe he s not because he will be sad if his grandson s stupid and cannot inverse a matrix.... just kidding XD
@prajwalchoudhary4824 Před 3 lety
@@NguyenAn-kf9ho lol
@hxqing Před 2 lety
Wishing he was and isn't ?
Better wishing he is.
@marekdude Před 3 lety ⁺³³
Positive Semi-Definite matricies: 38.01
@jonasblom6177 Před 2 lety ⁺¹
thanks man
@guythat779 Před 2 lety ⁺¹
What a king
@Luke-ty2ur Před 2 lety ⁺¹
u the man.
@miguiprytoluk Před 2 lety
38:01
@AJ-et3vf Před 2 lety ⁺²
Awesome video sir! Thank you!
@JulieIsMe824 Před 3 lety ⁺⁴
Sooo love Prof. Strang!!
@user-or7ji5hv8y Před 3 lety
Very comprehensive. Thanks
@mohamedlaminbangura3303 Před 4 lety ⁺¹
Great work
@amysun6080 Před 3 lety ⁺⁷
Lecture starts at 2:50
@alexandersanchez6337 Před 2 lety ⁺³
This professor is the platonic version of a professor
@olsela3073 Před 3 lety ⁺¹
Well thanks prof.
@imranq9241 Před 2 lety ⁺²
These are great lectures! Is the autograder and programming assignment available somewhere?
@parthmalik1 Před rokem ⁺¹
yes when u get admitted to MIT u can take up the class and partake in assignments
@binnypatel7061 Před 4 lety ⁺¹
Thanks a lot !
@samirroy1412 Před 4 lety ⁺⁵
I am doing a project on this topic it really helped me a lot..thank you
@samirroy1412 Před 3 lety
@@vishalyadav2958 yes
@samirroy1412 Před 3 lety
U r doing phd or post grad?
@samirroy1412 Před 3 lety
U can follow horn n johnson and strang book... it's relatively easier to understand
@quirkyquester Před 3 lety ⁺⁴
came here from 18.06 fall 2011 Singular value decomposition taught by Professor Strang
@heretoinfinity9300 Před 3 lety ⁺¹
Where was the energy equation mentioned in previous lectures?
@mingliu1940 Před rokem
Thanks professor.
@meenaammma Před 3 lety ⁺¹
Amazing
@justsomerandomguy933 Před 4 lety ⁺⁷
Staring at 22:00, should not we follow in the opposite of the gradient direction to reach minima? Gradient gives the steepest ascent directions as far as I know.
@zma4543 Před 4 lety ⁺²
I think you are right
@jenkinsj9224 Před 2 lety ⁺¹
Yes
@lolololo8606 Před 4 lety ⁺¹
you ate the best
@mdmobarakhossain8993 Před 3 lety ⁺¹
Love you sir .love from India .
@hangli1622 Před rokem ⁺¹
at 41:20, why the rank 1 matrix has 2 zero eigenvalues? because 3 - 1 = 2? does the professor mean that number of zero eigenvalues always equals to nullity of that matrix?
@sriharsha580 Před 4 lety ⁺³
@32:00, Prof mentions "if the eigenvalues are far apart, that's when we have problems". What does he mean by that?
@nguyennguyenphuc5217 Před 4 lety ⁺⁴
He means difference between eigenvalues, |lambda1 - lambda2|, is big, then we have the case where "the bowl is long and thin" he mentions right before that.
@gabrielmachado5708 Před 4 lety ⁺¹
@@nguyennguyenphuc5217, yes, it looks like it would make it easier to miss the point and bounce back and forth around the minimum
@debralegorreta1375 Před 3 lety ⁺³
@@gabrielmachado5708 right. it the bowl is narrow and your descent is slightly off you'll start climbing again.... so we take baby steps.
@MLDawn Před rokem
at 14:18, the energy can also so be EQUAL to 0 (not JUST bigger than 0)! Then does this not mean that the matrix is positive SEMI definite as opposed to positive definite?
@Hank-ry9bz Před měsícem
20:49 gradient descent
@lazywarrior Před 2 lety ⁺¹
who's that eager student answering every question for everyone else on every class?
@aklinvex Před 4 lety ⁺⁸
Who could possibly dislike this?
@allandogreat Před 4 lety ⁺³
who can't understand that.
@quanyingliu7168 Před 5 lety ⁺²
At 41min, Why is the number of nonzero eigenvalues the same as rank(A)?
@fustilarian1 Před 5 lety ⁺⁵
The eigenvectors with non zero eigenvalues must be mapped to somewhere within the column space, in all other directions outside the column space it collapses to 0, bear in mind that the null space vectors are also solutions to Ax=\lambda x where \lambda is 0.
@MoodyG Před 5 lety ⁺³
The answer is at 41:17 ... you notice how we can decompose the matrix into a weighted sum of its eigenvectors.. the weights being the eigenvalues obviously, and since Rank(A) is by definition the number of linearly independent vectors in the column space of A, i.e., it is the same as the number of non-zero terms in the decomposition, which is in turn the number of non-zero eigenvalues
@quanyingliu7168 Před 5 lety
@@fustilarian1 Thanks for your explanation. That's very helpful.
@csl1384 Před 5 lety ⁺⁴
Where can I find the online homework? I can't find it in OCW.
@mitocw Před 5 lety ⁺⁹
The homework can be found in the Assignments section of the course on MIT OpenCourseWare at: ocw.mit.edu/18-065S18. Best wishes on your studies!
@unalcachofa Před 5 lety ⁺³
@@mitocw
are the Julia language online asigmants mentioned also available somewhere? I see only problems from the textbook in the Assignments section of the OCW
@mitocw Před 4 lety ⁺¹
julialang.org/
@StuckNoLuck Před 4 lety ⁺²
@@mitocw Where can we locate the programming assignments?
@nicko6419 Před 4 lety
@@mitocw I have a question about the czcams.com/video/xsP-S7yKaRA/video.html
Where can I find this lab work about convolution?
On MIT OpenCourceWare at
ocw.mit.edu/courses/mathematics/18-065-matrix-methods-in-data-analysis-signal-processing-and-machine-learning-spring-2018/assignments/
I can find only book assignments
ocw.mit.edu/courses/mathematics/18-065-matrix-methods-in-data-analysis-signal-processing-and-machine-learning-spring-2018/assignments/MIT18_065S18PSets.pdf#page=7
Could you help me? Thanks!
@anubhav2198 Před 5 lety ⁺⁴
At 28:00 what is the intuition behind shape of the bowl and large/small eigenvalues? He made it sound like a quite obvious statement.
Also at 36:50, given that S and Q-1SQ are similar implies they have same eigen values. However, how do you show S and Q-1SQ are similar?
OK I figured out the 36:50 part. It is the spectral theorem which sir had covered in previous class. S = Q (lambda) Q-1.
Lambda = Q-1 S Q. As, lambda is defined as the matrix of eigen values of S, this implies that S and Q-1 S Q are similar.
Please explain the part at 28:00 . Thanks!
@ramman405 Před 5 lety ⁺¹³
Regarding similarity you don't need the spectral theorem, just to remember that we say that A and B are similar if there exists an invertible matrix M such that
A = M^(-1) * B * M
You can immediately verify that if A = Q^(-1) * S* Q, B = S, and M=Q, then the equation is satisfied so A=Q^(-1) *S* Q and B=S are similar.
Regarding the bowl statement, it should be pretty clear when the eigenvectors are [1,0] and [0,1]. In that case the energy function is given by:
[x,y] * S * [x,y]^T = x^2 * lambda1 + y^2 * lambda2.
So in the xz-plane it is just the quadratic function scaled by lambda1. In the yz-plane it is just the quadratic function scaled by lambda2 (and in general it is a linear combination of the two). If either eigenvalue is much larger than the other the scalings will be disproportionate and therefore we will get a bowl with a steep slope in the direction of the large eigenvalue, and pretty flat slope in the direction of the small eigenvalue.
However the whole point of diagonalization is that basically we can treat any diagonalizable matrix like the diagonal matrix of its eigenvalues as long as we do the appropriate orthogonal base change (or equivalently work in the correct coordinate system), so really we already know that the general bowl will be an orthogonal transformation of the bowl described above and therefore itself be a narrow valley bowl.
Concretely, if v1,v2 is an orthonormal basis of eigenvectors of S, with associated eigenvalues lambda1,lambda2, then the energy function is
v^T QDQ^T v
where
Q is the orthonormal matrix whose columns are v1,v2.
D is the diagonal matrix with elements lambda1,lambda2.
We can write v as a unique linear combination of the eigenvectors (it is a basis after all):
v= x * v1 + y * v2
Then the energy function evaluates to:
v^T QDQ^T v = v^T QD [x,y]^T
= v^T Q[lambda1 * x, lambda2 * y]
= v^T (lambda1 * x * v1 + lambda2 * y * v2)
= lambda1 * x^2 + lambda2 * y^2,
so again it is a bowl which in the direction of v1 is a 1-dimensional quadratic scaled by lambda1, and in the direction of v2 is a 1-dimensional quadratic scaled by lambda2. So if lambda1 is huge the slope in the direction v1 will be steep. Same as before, just from the point of view of the coordinate system given by the eigenvectors (v1,v2).
@jenkinsj9224 Před 2 lety
@@ramman405 thanks
@CM-Gram Před 4 lety ⁺²
What is meant by energy whe X^t S X multiplication is carried?
@spoopedoop3142 Před 3 lety
Are you asking why this quadratic form is called energy?
@CM-Gram Před 3 lety
@@spoopedoop3142 yes exactly
@bryanlau8712 Před 3 lety ⁺¹
@@CM-Gram Kinetic energy is 1/2mv^2, where v is the velocity vector, and potential energy is 1/2kx^2, where x is the position vector.
@mehmetozer5675 Před 3 lety ⁺¹
I am here to leave a like to the legend.
@suprithashetty9016 Před 3 lety
Voice ❤️
@jeeveshjuneja445 Před 5 lety ⁺³
I think the shape of the bowl will change when we add (x^T)b at 17:00 . Am I right???
@jeevanel44 Před 5 lety ⁺³
It will shift or tilt the bowl in X axis direction. You can try the vizualizer al-roomi.org/3DPlot/index.html
@hardikho Před 3 lety
@@jeevanel44 Hey, sorry to bother you a year later - what expression would I input to receive the bowl shown here?
@yeshuip Před 3 lety ⁺¹
hello, could anyone explains me the difference between energy function ans snorm taught by professor in lecture 8
@rayvinlai7268 Před 2 lety
Hopefully I can still love science at this age
@iwtwb8 Před 2 lety
Does he mean "a * a^T" near the end of the video?
@suprithashetty9016 Před 3 lety
Duster ❤️
@ML_n00b Před měsícem
when does he prove 3?
@vukasinspasojevic1521 Před rokem
Can we find homeworks/labs online?
@mitocw Před rokem
The course materials are available on MIT OpenCourseWare at: ocw.mit.edu/18-065S18. Best wishes on your studies!
@suprithashetty9016 Před 3 lety
Mic ❤️
@suprithashetty9016 Před 3 lety
Chalk ❤️
@hakimmecene2230 Před 4 lety
Hi , I need cours about matrices polynomial please .
@suprithashetty9016 Před 3 lety
Math ❤️
@xc2530 Před rokem
10:00 energy
19:00 convex
@xc2530 Před rokem
14:00 deep learning
@xc2530 Před rokem
24:00 gradient descent
@xc2530 Před rokem
27:00 eigenvalue tells the shape of the bowl
@xc2530 Před rokem
38:00 semi def pos
@jan-heinzwiers581 Před 3 lety ⁺¹
always a minus fault .... sqrt(68) not sqrt(60) , so one eigenvalue neg , yes .... 🤣😊 But now does Matlab opposite , to mine abc formula : (8 +/- sqrt(68))/2 for eigenvalues 🙄
@jan-heinzwiers581 Před 3 lety
Octave : -0.12311 , 8.12311 agrees with abc formula
@jan-heinzwiers581 Před 3 lety
Matlab too 😀
@suprithashetty9016 Před 3 lety
Accent ❤️
@anynamecanbeuse Před 3 lety
No we don't have to use gradient decent in this case
@kevinchen1820 Před 2 lety
20220517簽
@shuyuliu4016 Před 3 lety ⁺²
看着他越来越老唉时光
@allandogreat Před 2 lety
what is Convext? like that ....hahah
@o.y.930 Před 4 lety ⁺¹³
i hope this professor doesnt get any sexual assault charges with that much winking because his lectures are awesome.
@alexandrek2555 Před rokem
🤣
@freeeagle6074 Před rokem
I guess not, unless the air gets personified and files a case.
@cubegears Před rokem
wow hes old now....
@aubrey1008 Před 4 lety
I see that this professor does not take question in class. . Maybe if you email him.
@naterojas9272 Před 4 lety
Maybe no one raise their hand

Další v pořadí

Automatické přehrávání