Sapere Aude

Notes on MPPCA denoising

Introduction


This note is for the paper titled as Denoising of diffusion MRI using random matrix theory, which was published on the NeuroImage Journal on 2016.

Considering the publish data, this paper only compared the proposed method with two denoising methods used in denoising diffusion MRI: the non-local mean(ANLM), total variation(TV/TGV). And the authors concluded the shortcomings of these two denoising approaches:

ANLM: loss of spatial resolution of the image (blur) and introduction of additional partial volume effects that lead to complications in further quantitative analyses or to biases in diffusion modeling.

TV: dependency on a regularization term, introduction of reconstruction artifacts, and the fact that thermal noise is not the sole source of local variations. Indeed, fine anatomical details might be removed as well by this non-selective technique.

Then the authors pointed out the main problem of PCA denoising. They wrote:

The number of signal-carrying principle components, i.e. the number of components that significantly contribute to the description of the underlying diffusion process, is unknown and is expected to depend on imaging factors such as resolution, b-value and SNR. Hence, an objective criterion to discriminate between the signal-carrying and noise-only components has been missing... Commonly used criteria include thresholding of the eigenvalues associated with the principal components by an empirically set value.

Based on that, they objectified the threshold for PCA denoising by exploiting the fact that noise-only eigenvalues are expected to obey the universal Marchenko-Pastur law, a result of the random matrix theory for noisy covariance matrices. There is a blog by Terrence Tao describing the topics in random matrix theory.

The question is what's the Marchenko-Pastur law and how we use it to objectify the threshold for PCA?

Method


Marchenko-Pastur distribution

A redundant M×N data matrix X is the one that can be synthesized by a combination of a few, P{M,N} linearly independent sources, or principal components, derived via the singular value decomposition of X:

X=NUΛVT

where U and V are unitary matrices whose columns are the left-singular and right-singular vectors of X. Without loss of generality, we assume M < N. The diagonal elements Λ1,1,...ΛM,M of the M×N matrix Λ are the singular values, with Λi,i=λi being the ith eigenvalue of the M×M matrix:

Y=1NXXT=UΛ2UT

Marcheko-Pastur law: If X denotes m×n random matrix whose entries are independent identically distributed (iid.) random variables with mean 0 and variance σ2 < let Y=1NXXT and let λ1,λ2,...,λm be the eigenvalues of Y. Finally, consider the random measure

μm(A)=1m#{λjA},A

counting the number of eigenvalues in the subset A included in .

Assume that m,n so that the ratio m/nγ(0,+). Then μmμ, where

p(λ|σ,γ)=(λ+λ)(λλ)2πγλσ2,λ[λ,λ+],withλ±=σ2(1±γ)2

Hence, the M~=MP samllest nonzero eigenvalues λP+1λM obey the MP distribution. And γ=M~/N and σ the noise level. If M~P. Note that the width of the MP bulk spectrum equals:

λ+λ=4γσ2

and the expectation value of an MP distribution is given by:

λλ+p(λ|σ,γ)λdλ=σ2

The distribution edge λ+ distinguishes between noise- and significant signal carrying principle components. Nullifying all λλ+,ΛΛ~, and reconstructing the matrix results in a denoised matrix:

X~=NUΛ~VT

Using the expectation value, we can calculate the variance accumulated in the omitted eigenvalues is give by:

var(X)=E(XXT)E(X)2=E(Y)=1Mp+1Mλi=M~Mσ2;E(X)=0;

As the omitted and residual principal components are linearly uncorrelated, the variance σ2~ of the residual noise, contained within the P significant components, can be give by:

σ2~=σ2var(X)=PMσ2

The SNR after denoising should thus scale with M/P.

Denoising algorithm

Now it's clear that we should know the λ+, or know the P. The noise level σ will be given as a additional product. In this paper, the number P~ of significant components is given by incrementally increasing p until

i=p+1Mλi(Mp)σ2~(p).

The update of σ using the equation mentioned to calculate the spectrum length:

σ2~(p)=λp+1λM4γp.

Once the P~ is determined, the estimated noise level σ:

σ2~=i=P~+1MλiMP~.

Discussion


#MRI