Sparse component analysis: A new tool for data mining

Pando Georgiev, Fabian Theis, Andrzej Cichocki, Hovagim Bakardjian

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

30 Citations (Scopus)


In many practical problems for data mining the data X under consideration (given as (m × N)-matrix) is of the form X = AS, where the matrices A and S with dimensions mxn and nx N respectively (often called mixing matrix or dictionary and source matrix) are unknown (m ≤ n ≤ N). We formulate conditions (SCA-conditions) under which we can recover A and S uniquely (up to scaling and permutation), such that S is sparse in the sense that each column of S has at least one zero element. We call this the Sparse Component Analysis problem (SCA). We present new algorithms for identification of the mixing matrix (under SCAconditions), and for source recovery (under identifiability conditions). The methods are illustrated with examples showing good performance of the algorithms. Typical examples are EEC and fMRI data sets, in which the SCA algorithm allows us to detect some features of the brain signals. Special attention is given to the application of our method to the transposed system XT = ST AT utilizing the sparseness of the mixing matrix A in appropriate situations. We note that the sparseness conditions could be obtained with some preprocessing methods and no independence conditions for the source signals are imposed (in contrast to Independent Component Analysis). We applied our method to fMRI data sets with dimension (128 × 128 × 98) and to EEC data sets from a 256-channels EEC machine.

Original languageEnglish
Title of host publicationSpringer Optimization and Its Applications
PublisherSpringer International Publishing
Number of pages26
Publication statusPublished - 2007
Externally publishedYes

Publication series

NameSpringer Optimization and Its Applications
ISSN (Print)1931-6828
ISSN (Electronic)1931-6836


  • Blind Signal Separation
  • Clustering
  • Sparse Component Analysis


Dive into the research topics of 'Sparse component analysis: A new tool for data mining'. Together they form a unique fingerprint.

Cite this