Using NLP techniques for answering questions about COVID-19 using Kaggle Dataset

This is a summary of what was done, the full report is at NLP_COVID_Presentation.

Introduction and Scope

COVID-19 Open Research Dataset
- ~13 GB
- Over 135,000 scholarly articles
- Including over 68,000 with full text
First goal: What do we know about COVID-19 symptoms?
Second goal: How can we cluster papers into coherent groups?

First goal

Words representation

Chosen method: GloVe – Global Vectors for word representation
1. Generate corpus using the provided dataset
2. Create word vectors
3. Measure cosine distance

Words clustering

Main idea: Cluster words represented by their vector using k-means algorithm

Words cloud

Symptoms:

Organs:

Medications:

Second goal

Create feature vector for each paper using BOW model
Cluster vectors into coherent groups
Visualize clusters in a 2D plot

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
code		code
images		images
.gitignore		.gitignore
NLP_COVID_Presentation.pptx		NLP_COVID_Presentation.pptx
README.md		README.md
myplot.png		myplot.png
myplot2.png		myplot2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using NLP techniques for answering questions about COVID-19 using Kaggle Dataset

Introduction and Scope

First goal

Words representation

Words clustering

Words cloud

Symptoms:

Organs:

Medications:

Second goal

t-SNE with no labels:

t-SNE with k-means labels:

Interactive plot

About

Releases

Packages

Contributors 2

Languages

pavalucas/NLP_techniques_COVID-19_Kaggle_Dataset

Folders and files

Latest commit

History

Repository files navigation

Using NLP techniques for answering questions about COVID-19 using Kaggle Dataset

Introduction and Scope

First goal

Words representation

Words clustering

Words cloud

Symptoms:

Organs:

Medications:

Second goal

t-SNE with no labels:

t-SNE with k-means labels:

Interactive plot

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages