Modeling Dyadic Data with Binary Latent Factors

Ted Meeds; Zoubin Ghahramani; Radford Neal; Sam Roweis

Modeling Dyadic Data with Binary Latent Factors

Ted Meeds ,
Zoubin Ghahramani ,
Radford Neal ,
Sam Roweis

Neural Information Processing Systems | December 2006

Download BibTex

We introduce binary matrix factorization, a novel model for unsupervised matrix decomposition. The decomposition is learned by fitting a non-parametric Bayesian probabilistic model with binary latent variables to a matrix of dyadic data. Unlike bi-clustering models, which assign each row or column to a single cluster based on a categorical hidden feature, our binary feature model reflects the prior belief that items and attributes can be associated with more than one latent cluster at a time. We provide simple learning and inference rules for this new model and show how to extend it to an infinite model in which the number of features is not a priori fixed but is allowed to grow with the size of the data.