Skip to content

Latest commit

 

History

History
15 lines (13 loc) · 345 Bytes

README.md

File metadata and controls

15 lines (13 loc) · 345 Bytes

QA-Clustering

  • Clustering the answers by following methods

    1. LDA (latent Dirichlet allocation)
      • gensim module
      • scikit module
    2. Hierarchical
    3. Incremental
  • We can weight the words in two ways

    1. Bag of Words
    2. TF-IDF
  • Measuring similarities between the words

    1. Euclidean Distance
    2. Edit Distance