Skip to content

πŸš€ Welcome to my Kaggle submission for "Natural Language Processing with Disaster Tweets." In this challenge, we explore tweets, using NLP to distinguish between those about real disasters and those that aren't. The goal is to build a robust model for accurate disaster-related tweet prediction. πŸ† Impressive F1 score of 0.79926 on the public leader

Notifications You must be signed in to change notification settings

elmezianech/Kaggle-Competition-NLP-disasterTweets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Kaggle-Competition-NLP-disasterTweets

Competition Overview:

πŸš€ Welcome to my submission for the Kaggle competition "Natural Language Processing with Disaster Tweets." In this challenge, we delve into the fascinating world of tweets, leveraging Natural Language Processing (NLP) to distinguish between tweets about real disasters and those that aren't. The objective is to create a robust model that excels in predicting disaster-related content.

Achievements:

πŸ† Attained an impressive F1 score of 0.79926 on the public leaderboard. 🌟 Demonstrated the efficacy of SVM in accurately identifying disaster-related tweets.

Link : https://www.kaggle.com/code/elmezianech/notebook86470c7043

Solution Highlights:

πŸ” Data Exploration & Preprocessing:

Meticulous handling of missing values, including the creation of a 'has_location' binary feature. Strategic handling of NaN values in the 'keyword' column, ensuring data integrity.

✨ Text Preprocessing Mastery:

Utilized NLTK for advanced text preprocessing, encompassing URL removal, special character handling, punctuation removal, tokenization, and stemming.

🌐 Feature Extraction with TF-IDF:

Extracted meaningful features from processed text using TF-IDF (Term Frequency-Inverse Document Frequency) vectorization. Ensured the model comprehends the significance of each word in tweets.

πŸš„ Modeling with SVM:

Employed a robust Support Vector Machine (SVM) model for classification. Fine-tuned hyperparameters using GridSearchCV, optimizing the SVM configuration.

Next Steps:

Open to collaborative discussions and feedback for continuous improvement. πŸ™Œ Happy coding! πŸš€πŸ“Š #NLP #KaggleCompetition #DisasterTweetsPrediction #DataScienceWin

About

πŸš€ Welcome to my Kaggle submission for "Natural Language Processing with Disaster Tweets." In this challenge, we explore tweets, using NLP to distinguish between those about real disasters and those that aren't. The goal is to build a robust model for accurate disaster-related tweet prediction. πŸ† Impressive F1 score of 0.79926 on the public leader

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published