Analysis conducted for the tech challenge
- Snapshot fetching
- Tweet Gathering
- Crypto-Analysis
- Pandas
- Numpy
- Requests
- BeautifulSoup
- matplotlib
Due to the size of the challenge we prefered using the simple to use library requests.
- Pandas
- Numpy
- Pillow / PIL (new version)
- Wordcloud
- Scikit-learn (machine learning part)
- Pickle
- NLTK
- matplotlib
We collect tweets which concers our list of crypto of interest. We process and clean them in order to be able to filter them into ham or spam We filter them using a pre-trained filter model used to filter spams (own model) (Count vectorizer => tfidf Transformer => Multinomial Naïve Bayes) We then apply a sentiment analysis based on the
Results could be greatly improved by giving the algo a better consistant data. Due to the Tweepy / Twitter API restriction, free API provides us only with Streaming data.
Time serie analysis performed on our cryptucrrencies of interest.
- Pandas
- Matplotlib
- Numpy
- Cryptocurrency (Own libraries)