GitHub - andreip/twitter-authorities: Find authorities for Twitter topics. [Licenta][Undergraduate thesis]

Summary

Find authorities for Twitter topics. Needs to download tweets (fetching stage), then it can compute based on fetched tweets. Requirements:

the implementation uses mongodb to store downloaded tweets
see requirements.txt for more, but main packages used are: pymongo, tweepy, scikit-learn, numpy, scipy

See results from datasets included in dump/licenta folder. Results are in .html files in dump/licenta/results/ folder. These results have been obtained with the current code, by using only Retweet Impact and Mention Impact features, see paper/ folder for way more details and how we combined the metrics and analyzed results.

Usage

populate mongoDB database with tweets

use existing tweets in dump/ folder. Import by using the dump/import.sh script like so:
- have a mongod running instance before running the script
- run the script from root folder like $ ./dump/import.sh and you're done.
- verify in mongod that you have successfully imported the data and the features+metrics:
```
> db.getCollection("halep").count()  # 2736`
> db.getCollection("ukraine gas russia").count()  # 16203
```
or fetch it yourself like say for topic "ukraine gas russia":

# tells it to fetch a maximum of 100 pages x 100 per page => 10,000 tweets.
$ ./main.py fetch "ukraine gas russia" 100

after we've added tweets in mongoDB, we can compute authorities for the collected tweets:

# 2nd param should be a valid collection in mongoDB containing tweets.
# should give us the main authorities by processing all tweets from mongo collection "ukraine gas russia".
./main.py compute "ukraine gas russia"

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
dump		dump
helpers		helpers
migrate		migrate
paper		paper
render_results		render_results
tests		tests
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
config.rc		config.rc
constants.py		constants.py
main.py		main.py
patch_tweepy.py		patch_tweepy.py
preprocess_words.py		preprocess_words.py
requirements.txt		requirements.txt
stopwords.txt		stopwords.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Summary

Usage

About

Releases

Packages

Languages

andreip/twitter-authorities

Folders and files

Latest commit

History

Repository files navigation

Summary

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages