Path: blob/master/material/linear-algebra-diagonalization.ipynb
934 views
Ortogonal matrices
En forma matricial, la transformación del sistema de coordenas inicial al sistema de coordenados rotado por un ángulo está dado por
Theorem 1
If is Hermitic, e.g: Then exists an unitary matrix e.g: such that where is the diagonalized mass matrix.
Corollary 1
If is symmetric, e.g: Then exists an ortogonal matrix , e.g: such that where is the diagonalized mass matrix.
Eigenvector problem
Note that each eigenvector, corresponding to each column of the matrix, is associated to each eigenvalue where is the -th column of the matrix .
In this way, if we interchange the columns of the diagonalization matrix , the order of the eigenvalues also change This property is very important because usually the diagonalizationn alghoritm gives not the desired ordering of the eigenvalues and eigenvectors. It is recommended to use the np.c_
method for the eigenvector reoirdering
However, the Theorem only guarantees existence. Tor really calculate the diagonalization matrix we must establish the eigenvector problem:
We can use the unitary propery to write or Therefore, the eigenvalue equation is just: ParseError: KaTeX parse error: Expected & or \\ or \cr or \end at end of input: \begin{align} % \boldsymbol{A}\,\boldsymbol{V}_i=&\lambda_i\boldsymbol{V}_i \nonumber\\ \boldsymbol{A}\,\boldsymbol{V}_i-\lambda_i\boldsymbol{V}_i=&\boldsymbol{0} \nonumber\\ (\boldsymbol{A}-\lambda_i \,\boldsymbol{I})\boldsymbol{V}_i=&\boldsymbol{0}\,, %ParseError: KaTeX parse error: Expected 'EOF', got '\end' at position 2: \̲e̲n̲d̲{align} where is the -th column of the matrix , and is the identity matrix.
To avoid the trivial solution , we require that does not have an inverse, or equivalently
Example
From arXiv:2010.06458:
In the three flavor neutrino oscillation, the neutrino flavor states are linear superposition of mass eigenstates where are the elements of the lepton mixing matrix known as PMNS (Pontecorvo-Maki-Nakagawa-Sakita) matrix such that The time evolution follows where is the energy associated with the mass eigenstates This is a superposition state.
The observables associated to the three neutrinos are the entries of the unitary matrix , and the eigenvalues associated to each eigenvector , , . The normal ordering is . The unitary matrix can be parameterized in terms of three mixing angles, , , and a complex phase, , such that where and . Thus, we can write as so that
After decades of experimental efforts with thousands of millions of dollars in investment and two recent Nobel prizes, most of the parameters are already measured (see PDG22020 ):
where is the squared mass difference between eigenvalues and ; and is really where is the speed of light in vacuum in natural units with , and .
To a better measurement of for example, a new large experiment called DUNE, and with a cost of around million of dollars, is in construction in the United States
Theorem 2: Singular value decomposition (SVD)
See SVD (where the Hermetique-conjugate is denoted with "*" instead that with "")
A general complex matrix can be diagonalized by a bi-diagonal transformation such that where is the diagonalized mass matrix.
Demostration
Since is hermitic and is diagonal, then in fact there exists an unitary matrix . Similarly there exists an unitary matrix which diagonalizes
Scipy implementation
The implementation in scipy is based in the inverted relation
The implementation is trough the module scipy.linalg.svd
:
Eigenvectors and eigenvalues
If we make
We know that there exists a bi-diagonal transformación such that ParseError: KaTeX parse error: Expected & or \\ or \cr or \end at end of input: …gin{equation} % \boldsymbol{A}\,\boldsymbol{U}=\boldsymbol{V}\boldsymbol{A}_{\text{diag}} \end{equation}$$ ParseError: KaTeX parse error: Expected & or \\ or \cr or \end at end of input: …ldsymbol{V}_i % \end{equation}$$ not sum upon . Here
are called eigenvalues
and are the eigenvectors
We can use this to check the proper order of the eigenvalues
Transformation of a linear system
We start again with the matrix equation, capitol bold letters denotes matrices where is an matrix.
We know that there exists a bi-diagonal transformación such that So, by doing standard operations we have where or
If , and , the solution of the system is given by
Note that and the final solution is Therefore
Example
A suitable way to introduce this method is applying it to some basic problem. To do so, let's take the result of the Example 1:
As the matrix is symmetric and
Check if all eigenvalues are different from zero:
Also as
Theorem 1 in numpy
WARNING: Only works for Hermitic matrices!
We first check the proper order of the diagonalization
Since
The final solution is:
check with np.linalg.inv(A_diag)
Activity: Usar np.lingalg.solve
We can now check some properties.
For the case
Obtain for and, in the proper order
Extract the first eigenvector
associated with the eigenvalue
Check:
Which means the eigenvalue associated to the "operator" acting on the eigenvector
The diagonalization matrix can be rebuild from the eigenvectors
is rebuild with
or with: np.hstack((V0,V1,V2))
Note that a sign of an egigenvalues can be changed:
Eigenvector reordering
We can use this to check the proper order of the eigenvalues.
The order of eigenvalues can now be changed by changing the order of the eigenvectors and redifining the diagonalization matrix. For example, from small to large.
Then the proper order in the eigenvalues can be obtained
Once in the proper order, the mixing angles can be obtained so that
Implementation of an algorithm for reordering
Returns the indices that would sort an array.
can be implemented in general with a comprehension list
For rebuild
And for reorder to index
or:
Implementation as function
Activity: Build a function that diagonalize symmetric matrices with the eigenvalues in increasing order in the eigenvalues as a replacement of np.linalg.eig
General matrix
Example: Consider the following general real matrix without any specific symmetry
In this case we need to make the bidiogalization process of Theorem 2 with the Singular value decomposition (SVD) implemented in Scipy
This is important to stablish that the eigenvectors are determined until ordering and permutations
For a general matrix they are just ortogonal matrices
We define below a orthogonal matrix:
Note that, for , after
Interchange and
,
we get
Note that after
Interchange and
,
we get .
Since the required trasformations in and are the same, the Theorem 2 is unnafected, only the order in the eigenvalues change to the normal ordering. In Fact
Activity: Solve the system for the previous matrix and
Interpretations of Theorem 2
Single Diagonalization matrix
In Theorem 2 establishes a unique set of a matrices: with its eigenvalues matrix, , and their eigenvectors matrices and . However, for a fixed set of eigenvalues and eigenvectors we can have several possibilities of matrices. For example, if we fix so that
Therefore: Under this conditions, for a fixed set of eigenvalues and eigenvectors (associated to the matrix ) there is a unique matrix
Example Let in the previous example. Find the matrix which gives rise to the same eigenvalues
Finally, we can use the full procedure of hermitic matrices to obtain the bidiagonal matrices
Example: Diagonalize the matrix by using the Theorem 1 instead
→ is obtained with <ALT GR>+i
or, similarly: ↓←
Actividad: Make the algorithm of reordering for theorem 2
Mixed terms
Let:
Consider the quadratic equation The quadratic equation is in terms of: , y
We can simplify this expression if we change to a new basis in which is diagonal, in such a case the crossed term would disappear. The rotation from is defined by where is the rotation matrix Therefore
In the new basis
such that , where
In this basis, the quadratic equation is in terms of eigenvalues and mixing angle, . Therefore, there are not longer mixed terms.
The diagonalization of quadratic equations can be straightforwardly generalized to -th degree equations in terms of matrices
Example: Electroweak interactions
To understand the electromagnetic and weak fundamental interactions, the mathematical formulation need to be done in one basis where the photon field, denoted with a symbol , is still not well defined. Instead, the field , the precursor of , appears along with the weak field , the precursor of the electroweak field . In the mathematical basis we have then and there, the symmetric mass matrix is calculated as (Details here: PDF)
Checking that the determinant is zero
This imply that one egivanlue is zero
which means that the matrix rank, the number of non-zero eigenvalues, is 1
To make the change to the phyisical basis, through the rotation matrix the following transformation need to be established such that is the diagonalization matrix of
with the normal ordering: , and
→ Diagonalization matrix
→ Ortogonal matrix
→ Rotation matrix
To obtain the eigensystem, we use
which in fact satisfy
Since the first (zero) eigenvalue is the one associated to , we can interpret directly the rotation matrix without changing the order of the eigenvectors
Note that in fact, the absolute value of the first element of the first eigenvector is the larger one and corresponds to the component along the -axis.
Therefore
The eigenvalue associated to is
corresponding to the mass in units of . As a reference, the proton mass is approximately
The physical observable associated to the weak mixing angle, , is (see PDF, along with the boson mass, )
Corolary 1 Theorem II
For a unique unitary matrix and a set of unique of eigenvalues there exists an infite set of mass matrices , associated to the infinite sets of matrices , such that where is the diagonalized mass matrix. In particular, if is hermitic then there is only one possibility for and therefore is unique.
Corolary 2 Theorem II
An unique hermitic mass matrix, , can be generated from an infinite set of arbitrary matrices , such that
Example: Neutrino mass matrix
Returning back to the neutrino mixing discussion, it is worth noticing that in the mathematical basis the mass matrix, is non-diagonal. However, if we assume that it is hermitic, then an unitary tranformation (rotation in the symmetric case), ,can be defined to diagonal basis as which is identified as the diagonalization matrix
According to the corrolary there is a set of infinite matrices complatible with an unique and the eigenvalues
Casas-Ibarra parameterization
Consider a symmetric matrix . We can assumme without lost of generality that this can be generated from a matrix such that Theorem 1 gurantees that exists an ortogonal matrix such that where where are the eigenvalues of . Therefore where Therefore, exists an ortogonal arbitrary matrix , such that
In this way, the matrix can be parameterized in terms of as
By using the previous equations, build a matrix with the following conditions
is an orthogonal matrix with a rotation angle as a random number between . Use your identification number as the seed of the random number generator.
The eigenvalues are and .
is a diagonalization matrix with mixing angle
Build the matrix and check that has the proper eigenvalues and eigenvectors
From the equation for
The symmetric matrix is
After the reordering of the eigenvalues, we would obtain the original and therefore
In this way, there is an infinity set of symmetric matrices which have the same eigenvalues and eigenvectors
THDM CP-even masses and mixing
Some times, the convention to identify the large projections in the non-diagonal basis.
In this case we have the rotation between an interaction basis and a physical basis defined as Since the diagonalization of a symmetric matrix can be obtained analitically, here we build a symmetric matrix from the eigenvalues, and , and the rotation angle, , and check with the numerical results by using a benchmark point:
Here, the eigenvalue is defined as the one with the eigenvector with the larger projection upon
The formula for the mass matrix in terms of the previous parameters can be found in [hep-ph/0207010,arXiv:1507.00933]. We assumme here that . See Appendix D of [hep-ph/0207010] for full formulas
In this way, from the BP point, we can build the symmetric mass :
By using the proper basis Therefore Since then . In this way, we must obtain from to avoid ambiguities with a global sign in the second eigenvector
The eigenvector with large projection in the second component along ( Interaction basis → Mass basis ) is the associated with the standard model Higgs mass,
Analytical diagonalization from [arXiv:1507.00933]
Activity
Let Choose the diagonalization method (A/B)
A:
B: