A nonlinear principal component decomposition

Florian Gunsilius

Published on 28 March 2017

This Working Paper has been replaced by Independent nonlinear component analysis

The idea of summarizing the information contained in a large number of variables by a small number of "factors" or "principal components" has been widely adopted in economics and statistics. This paper introduces a generalization of the widely used principal component analysis (PCA) to nonlinear settings, thus providing a new tool for dimension reduction and exploratory data analysis or representation. The distinguishing features of the method include (i) the ability to always deliver truly independent factors (as opposed to the merely uncorrelated factors of PCA); (ii) the reliance on the theory of optimal transport and Brenier maps to obtain a robust and efﬁcient computational algorithm and (iii) the use of a new multivariate additive entropy decomposition to determine the principal nonlinear components that capture most of the information content of the data.