reproduction_wlwf

Installation

clone this repository
install dependencies

cd reproduction_wlwf
pip install -r requirements.txt

Format your data in the following trees

deputes
├── lr
│   ├── 20220620.csv
│   ├── 20220621.csv
│   ├── 20220622.csv
    ...
├── majority
│   ├── 20220620.csv
│   ├── 20220621.csv
│   ├── 20220622.csv
    ...
├── nupes
│   ├── 20220620.csv
│   ├── 20220621.csv
│   ├── 20220622.csv
    ...
└── rn
│   ├── 20220620.csv
│   ├── 20220621.csv
│   ├── 20220622.csv

media
├── 20220620.csv
├── 20220621.csv
├── 20220622.csv
    ...

supporters
├── lr
│   ├── 20220620.csv
│   ├── 20220621.csv
│   ├── 20220622.csv
    ...
...

The csv files should have the following columns: id, local_time, text, user_screen_name, user_id, retweeted_id

id                  local_time          text                 user_screen_name user_id             retweeted_id
1587218214638985216 2022-11-01T00:01:26 RT @UEFrance: 🆕 Es… trudigoz         347374931           1587030788331159553
1587355550840414208 2022-11-01T09:07:09 RT @midy_paul: #Sai… midy_paul        1090311673985056770 1587112047480918018
1587374936288632833 2022-11-01T10:24:11 Cérémonies du Souve… Bannier_G        866695760905154560  <empty>

Create document-term matrix for a given public

See the examples below.

congress:

python 01-create-dtm.py congress your/path/to/folder/deputes/

The results will be saved in data_prod/dfm/congress-....txt

media

python 01-create-dtm.py media your/path/to/folder/media/

The results will be saved in data_prod/dfm/media-....txt

supporters

python 01-create-dtm.py supporter your/path/to/folder/supporters/

The results will be saved in data_prod/dfm/supporter-....txt

Etc.

Encoding with Sentence-BERT

python 01_encode_with_sbert.py data_source/tweets_from_deputes data_prod/embeddings/deputes/

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
data_prod		data_prod
data_source		data_source
documentation		documentation
plots		plots
.gitignore		.gitignore
00_requirement.R		00_requirement.R
01-create-dtm.py		01-create-dtm.py
01_encode_with_sbert.py		01_encode_with_sbert.py
02-choosing-number-topics.r		02-choosing-number-topics.r
02_run_bertopic.py		02_run_bertopic.py
03-running-lda.r		03-running-lda.r
04a-dashboard-generating-summaries.r		04a-dashboard-generating-summaries.r
04b-generating-dashboard-shiny.R		04b-generating-dashboard-shiny.R
04b-generating-html.r		04b-generating-html.r
LICENSE		LICENSE
README.md		README.md
figures_utils.py		figures_utils.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

reproduction_wlwf

Installation

Format your data in the following trees

Create document-term matrix for a given public

Encoding with Sentence-BERT

About

Releases

Packages

Contributors 4

Languages

License

medialab/reproduction_wlwf

Folders and files

Latest commit

History

Repository files navigation

reproduction_wlwf

Installation

Format your data in the following trees

Create document-term matrix for a given public

Encoding with Sentence-BERT

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages