Milestone II MADS

LDA MMDS

Bubble = topic
The larger the bubble, the higher percentage of the number of sentences in the corpus is about that topic.
The more distance between the bubbles the more different they are.
Blue bars = the frequency of each word in the corpus
If no topic is selected, the blue bars of the most frequently used words will be displayed.
Red bars = estimated number of times a given term was generated by a given topic
The word with the longest red bar is the word that is used the most by the sentences belonging to that topic.
This was made with the pyLDAvis Library