Back to main page

DEMO: Online Latent Dirichelt Allocation


Here, we present a demo of topic model, Online Latent Dirichlet Allocation (Online LDA) [1]. A sample from the book "Pride and Prejudice":

olda's example

The words in the same colors are assigned to the same topic.

The model is trained with Wikipedia articles in 2010. Vocabulary size is 10,000 (listed in here), and the number of latent topics is 100. You can estimate latent topics assigned to the words in the vocabulary via the online topic model.

How to use:


[1] Matthew D. Hoffman, David M. Blei, and Francis Bach, "Online Learning for Latent Dirichlet Allocation", in Neural Information Processing Systems (NIPS), 2010.