(from: wikipedia)
The whitening transformation is a decorrelation method which transforms a set of random variables having the covariance matrix Σ into a set of new random variables whose covariance is aI, where a is a constant and I is the identity matrix. The new random variables are uncorrelated and all have variance 1. The method is called "whitening" because it transforms the input matrix to the form of white noise, which by definition is uncorrelated and has uniform variance. It differs from decorrelation in that the variances are made to be equal, rather than merely making the covariances zero. That is, where decorrelation results in a diagonal covariance matrix, whitening produces a scalar multiple of the identity matrix.
Definition
Define X to be a random vector with covariance matrix Σ and mean 0. The matrix Σ can be written as the outer product of X and XT:
(注,我初一看以為寫錯了,按我的想法好像寫成 E[XiXj] 更靠譜,我又想了想,后來明白了,E[XiXj]這樣寫的話,括號里面成了兩個變量,而實際應該是一個變量(總體),前后可以是兩個變量樣本值。所以 E[XXT]這樣寫是有道理的,只是注意不要誤解吧。還可以這樣寫 E[XYT],X和Y是同分布,但是這是表示兩個一般變量協方差的常用方法,顯然和這里X是向量的環境不符合。)
Define Σ1/2 as
Define the new random vector Y = Σ-1/2X. The covariance of Y is
Thus, Y is a white random vector.
Based on the fact that the covariance matrix is always positive semi-definite, Σ1/2 can be derived using eigenvalue decomposition:
Where the matrix Λ1/2 is a diagonal matrix with each element being square root of the corresponding element in Λ. To show that this equation follows from the prior one, multiply by the transpose: