Topic modeling using word2vec & LDA

Content

The objective is to extract information and value from large volumes of textual data using Natural Language Processing (NLP). This notebook focuses on the use of the word2vec algorithm to represent and study the existing similarities between the words of several documents and on the combination of word2vec and the unsupervised learning algorithm LDA to perform topic modeling by grouping the documents by topic and by detailing the keywords of each document.

Requirements

Python version 3.9.7

File details

nlp-topic-modeling
- This is a .ipynb file which contains the code.
data
- This folder contains the data.

Here is the project pattern:

- project
    > nlp-topic-modeling
        - nlp-topic-modeling.ipynb
        > data 
            - papers.csv

Features

My profil • My GitHub • Original Kaggle dataset

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
nlp-topic-modeling.ipynb		nlp-topic-modeling.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Topic modeling using word2vec & LDA

Table of contents

Content

Requirements

File details

Features

About

Releases

Packages

Languages

lprtk/nlp-topic-modeling

Folders and files

Latest commit

History

Repository files navigation

Topic modeling using word2vec & LDA

Table of contents

Content

Requirements

File details

Features

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages