Bayesian Robust Tensor Factorization for Incomplete Multiway Data

Qibin Zhao, Guoxu Zhou, Liqing Zhang, Andrzej Cichocki, Shun Ichi Amari

Результат исследований: Вклад в журналСтатьярецензирование

133 Цитирования (Scopus)

Аннотация

We propose a generative model for robust tensor factorization in the presence of both missing data and outliers. The objective is to explicitly infer the underlying low-CANDECOMP/PARAFAC (CP)-rank tensor capturing the global information and a sparse tensor capturing the local information (also considered as outliers), thus providing the robust predictive distribution over missing entries. The low-CP-rank tensor is modeled by multilinear interactions between multiple latent factors on which the column sparsity is enforced by a hierarchical prior, while the sparse tensor is modeled by a hierarchical view of Student-$t$ distribution that associates an individual hyperparameter with each element independently. For model learning, we develop an efficient variational inference under a fully Bayesian treatment, which can effectively prevent the overfitting problem and scales linearly with data size. In contrast to existing related works, our method can perform model selection automatically and implicitly without the need of tuning parameters. More specifically, it can discover the groundtruth of CP rank and automatically adapt the sparsity inducing priors to various types of outliers. In addition, the tradeoff between the low-rank approximation and the sparse representation can be optimized in the sense of maximum model evidence. The extensive experiments and comparisons with many state-of-the-art algorithms on both synthetic and real-world data sets demonstrate the superiorities of our method from several perspectives.

Язык оригиналаАнглийский
Номер статьи7120147
Страницы (с-по)736-748
Число страниц13
ЖурналIEEE Transactions on Neural Networks and Learning Systems
Том27
Номер выпуска4
DOI
СостояниеОпубликовано - апр. 2016
Опубликовано для внешнего пользованияДа

Fingerprint

Подробные сведения о темах исследования «Bayesian Robust Tensor Factorization for Incomplete Multiway Data». Вместе они формируют уникальный семантический отпечаток (fingerprint).

Цитировать