Abstract:Principal Component Analysis (PCA) and its nonlinear extension Kernel PCA (KPCA) are widely used across science and industry for data analysis and dimensionality reduction. Modern deep learning tools have achieved great empirical success, but a framework for deep principal component analysis is still lacking. Here we develop a deep kernel PCA methodology (DKPCA) to extract multiple levels of the most informative components of the data. Our scheme can effectively identify new hierarchical variables, called deep principal components, capturing the main characteristics of high-dimensional data through a simple and interpretable numerical optimization. We couple the principal components of multiple KPCA levels, theoretically showing that DKPCA creates both forward and backward dependency across levels, which has not been explored in kernel methods and yet is crucial to extract more informative features. Various experimental evaluations on multiple data types show that DKPCA finds more efficient and disentangled representations with higher explained variance in fewer principal components, compared to the shallow KPCA. We demonstrate that our method allows for effective hierarchical data exploration, with the ability to separate the key generative factors of the input data both for large datasets and when few training samples are available. Overall, DKPCA can facilitate the extraction of useful patterns from high-dimensional data by learning more informative features organized in different levels, giving diversified aspects to explore the variation factors in the data, while maintaining a simple mathematical formulation.

Matrix-based Kernel Principal Component Analysis for Large-Scale Data Set.

Incomplete Cholesky Decomposition Based Kernel Principal Component Analysis For Large-Scale Data Set

A Novel Kernel Possibitistic Fuzzy C-Means Clustering Algorithm For Large Scale Data Sets

Improved <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M1"><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:math>-<mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M2"><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:math> PCA Algorithm Using Artificial Sparsity in Dynamic MRI

Robust Principal Component Analysis Based on Maximum Correntropy Criterion

To Solve Kernel Principal Component Analysis Using Iterative Method

Adaptive Kpca Modeling of Nonlinear Systems

An Iterative Algorithm for Robust Kernel Principal Component Analysis

Adaptive kernel subspace method for speeding up feature extraction

KPCA Based on Feature Samples for Fault Detection

Nonlinear Component Analysis for Large-Scale Data Set Using Fixed-Point Algorithm

Efficient Iterative Dynamic Kernel Principal Component Analysis Monitoring Method for the Batch Process with Super-large-scale Data Sets

Nonlinear Process Monitoring Using Improved Kernel Principal Component Analysis

A Locality Preserving Approach for Kernel PCA

Deep Kernel Principal Component Analysis for Multi-level Feature Learning

Manifold Principle Component Analysis for Large-Dimensional Matrix Elliptical Factor Model

A Covariance-Free Iterative Principal Component Analysis for High Dimensional and Large Scale Data

Sparse Kernel Extreme Components Analysis

Kernel Additive Principal Components

Robust and Sparse Kernel PCA and Its Outlier Map

An Infinite Dimensional Analysis of Kernel Principal Components