Sentiment Analysis and Natural Language Processing for Bitcoin and Ethereum
There's been a lot of hype in the news lately about cryptocurrency, so I took a stock, so to speak, of the latest news headlines regarding Bitcoin and Ethereum to get a better feel for the current public sentiment around each coin.
I applied natural language processing to understand the sentiment in the latest news articles featuring Bitcoin and Ethereum. I also applied fundamental NLP techniques to better understand the other factors involved with the coin prices such as common words and phrases and organizations and entities mentioned in the articles.
Completed tasks:
Used the newsapi to pull the latest news articles for Bitcoin and Ethereum and create a DataFrame of sentiment scores for each coin.
In this section, I used NLTK and Python to tokenize the text for each coin, and :
- Lowercase each word
- Remove punctuation
- Remove stop words
Next, looked at the ngrams and word frequency for each coin.
- Used NLTK to produce the ngrams for N = 2.
- Listed the top 10 words for each coin.
Finally, generated word clouds for each coin to summarize the news for each coin.