Category Archives: Concepts and Techniques

Model Selection

| May 8, 2014 2:56 pm |

(First of all, I inform you that this post are being edited now…) Bootstrapping Bootstrapping […]

Pachinko Allocation

| November 11, 2013 4:11 pm |

Pachinko allocation model (PAM) captures arbitrary, nested, and possibly sparse correlations between topics using a […]

Missing data

| November 8, 2013 3:15 pm |

In statistics, missing data (or missing values) occur when no data value is stored for […]

Correlated Topic Models, CTM

| October 4, 2013 7:12 pm |

Motivation A limitation of Latent Dirichlet Allocation (LDA) is the inability to model topic correlation. […]

Gibbs Sampling for Topic Models

| August 27, 2013 11:56 am |

Introduction Suppose that we are under the situation that we want to discover the topics […]

Latent Dirichlet Allocation, LDA

| July 30, 2013 4:14 am |

Motivations LDA overcomes the problems in PLSA model by treating the topic mixture weights as […]

Probabilistic Latent Semantic Analysis, PLSA

| June 12, 2013 12:22 pm |

Topic model 문서들의 집합에서 topic들을 찾아내기 위한 모델로, 눈에 보이는 observation, 즉, given data에 대해 […]

Collaborative Filtering, CF

| May 23, 2013 12:08 pm |

Collaborative Filtering Collaborative filtering (CF) is a technique used by some recommender systems. In general, […]

Frequent Pattern Analysis

| April 10, 2013 2:20 pm |

Frequent pattern is a pattern that occurs frequently in a data set. Association Rules Find […]

MinHash

| April 9, 2013 2:07 pm |

In computer science, MinHash (or the min-wise independent permutations locality sensitive hashing scheme) is a […]

PageRank

| March 13, 2013 5:51 pm |

Motivation Before introducing the basic concept of PageRank, we should first consider why we need […]