Pathway LDA is a probabilistic model extended from Latent Dirichlet Alllocation, a probabilistic model for extracting topics in text mining, to incorporate the information of pre-defined pathways (factors) while learning pathways that are responsible for the cellular processes.