This project applies agglomerative clustering to group Pokemon based on their Attack and HP stats.
This data science project aims to discover meaningful clusters among Pokemon using their Attack and HP statistics. By employing agglomerative clustering, an unsupervised machine learning technique, we can identify groups of Pokemon with similar characteristics in these two key attributes.
- Data preprocessing of Pokemon statistics
- Implementation of agglomerative clustering algorithm
- Visualization of clusters using scatter plots
- Analysis of cluster characteristics and compositions
The project uses a dataset containing various Pokemon statistics, focusing primarily on the Attack and HP attributes for clustering purposes. Here's the Pokemon Stats Dataset and Pokemon Images Dataset from [G-drive Source]. It includes various attributes of Pokemon, with a focus on Attack and HP for this clustering analysis.
- Data cleaning and normalization
- Feature selection (Attack and HP)
- Application of agglomerative clustering
- Determination of optimal number of clusters
- Visualization of results
- Interpretation of cluster meanings
- Python 3.x
- NumPy
- Pandas
- Scikit-learn
- Matplotlib
- Seaborn
- Clone the repository:
git clone https://github.com/itsmarmot/Pokemon-Clustering.git
- Navigate to the project directory:
cd Pokemon-Clustering
- Install the required dependencies:
pip install -r requirements.txt
- Run the project script:
python cluster_pokemon.py
- Incorporate additional Pokemon attributes for more complex clustering
- Compare results with other clustering algorithms (e.g., K-means, DBSCAN)
- Develop a recommendation system based on cluster analysis
Feel free to customize the content as needed!