Games of Thrones - Data Science Projects

This repository contains data science projects centered around the Game of Thrones universe. The projects explore and analyze text data from the show, leveraging machine learning and natural language processing (NLP) techniques. The goal is to gain insights from dialogues, characters, and other textual elements.

Overview

This project is a data-driven exploration of the Game of Thrones world, focusing on the unique language used by characters. By applying text processing, sentiment analysis, and word frequency techniques, we uncover the patterns in the characters' dialogue. It is ideal for those who want to explore text mining and NLP in a fun and practical way.

Projects

1. Unique Character Dialogues

This project extracts and processes unique dialogue for each major character.
Goal: Understand the distinct speech patterns of different characters.
Core Tasks: Text parsing, data cleaning, and outputting results to individual files.

2. Sentiment Analysis

Analyzes the sentiment (positive, negative, neutral) of various characters' dialogues.
Tools: TextBlob, VADER.
Goal: Understand how the sentiment of the characters' language evolves throughout the series.

3. Word Frequency Analysis

Identifies frequently used words and phrases by the characters.
Goal: Discover the most important or repeated themes.
Tools: NLTK, Pandas.

4. Named Entity Recognition (NER)

Detects and categorizes named entities like people, places, and organizations in Game of Thrones texts.
Goal: Build a list of key entities mentioned in the dialogues.
Tools: SpaCy.

Installation

To use this repository, follow these steps:

Clone the repository:

git clone https://github.com/jarvismayur/Games-of-Thornes---Data-Scince-Projects.git

Navigate to the project directory:

cd Games-of-Thornes---Data-Scince-Projects

Set up a Python virtual environment and install the required libraries:

# Create virtual environment
python -m venv venv

# Activate the virtual environment
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

Technologies Used

Python: Main programming language used.
Pandas: Data manipulation.
NLTK: Natural Language Processing.
SpaCy: Named Entity Recognition (NER).
TextBlob: Sentiment analysis.
Matplotlib & Seaborn: Data visualization.
Jupyter Notebook: For project development.

Usage

You can run any of the projects in this repository by navigating to the project folder and executing the corresponding Jupyter notebook or Python script. For example, to analyze character dialogues:

cd Unique-Character-Dialogues/
jupyter notebook dialogue_analysis.ipynb

Contributing

If you'd like to contribute to this repository:

Fork the repository.
Create a new branch (git checkout -b feature-branch).
Make your changes and commit them (git commit -m 'Add new feature').
Push to the branch (git push origin feature-branch).
Create a pull request.

License

This project is licensed under the Apache License 2.0. See the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitattributes		.gitattributes
ARYA.txt		ARYA.txt
BRAN.txt		BRAN.txt
CASSEL.txt		CASSEL.txt
CATELYN.txt		CATELYN.txt
CERSEI.txt		CERSEI.txt
GARED.txt		GARED.txt
JAIME.txt		JAIME.txt
JON.txt		JON.txt
LICENSE		LICENSE
NED.txt		NED.txt
README.md		README.md
ROBB.txt		ROBB.txt
ROBERT.txt		ROBERT.txt
ROYCE.txt		ROYCE.txt
SANSA.txt		SANSA.txt
SEPTA MORDANE.txt		SEPTA MORDANE.txt
THEON.txt		THEON.txt
Untitled.ipynb		Untitled.ipynb
conv.txt		conv.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Games of Thrones - Data Science Projects

Table of Contents

Overview

Projects

1. Unique Character Dialogues

2. Sentiment Analysis

3. Word Frequency Analysis

4. Named Entity Recognition (NER)

Installation

Technologies Used

Usage

Contributing

License

About

Languages

License

jarvismayur/Games-of-Thornes---Data-Scince-Projects

Folders and files

Latest commit

History

Repository files navigation

Games of Thrones - Data Science Projects

Table of Contents

Overview

Projects

1. Unique Character Dialogues

2. Sentiment Analysis

3. Word Frequency Analysis

4. Named Entity Recognition (NER)

Installation

Technologies Used

Usage

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages