Skip to content

nedamhs/TopicModeling_LDA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TopicModeling_LDA

Latent Dirichlet Allocation (LDA) applied to 2,000 PubMed abstracts using collapsed Gibbs sampling to discover 20 latent biomedical topics.

output quality is evaluated using topic coherence metrics (c_v & c_npmi), indicating interpretable biomedical topics.

See the full implementation and results in LDA.ipynb.

Learned Topics

Word Clouds

Acknowledgment

Collapsed Gibbs sampling implementation borrowed & adapted from CS179 course material by Prof. Alexander Ihler.

About

Topic modeling of PubMed abstracts with LDA using collapsed Gibbs sampling.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors