Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Blood Donation Prediction #522

Closed
wants to merge 2 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
797 changes: 797 additions & 0 deletions Blood Donation Prediction/Model/blood_donation_prediction.ipynb

Large diffs are not rendered by default.

58 changes: 58 additions & 0 deletions Blood Donation Prediction/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,58 @@
# Blood Donation Analysis

## Project Overview

This project aims to create a predictive model that can accurately identify donors likely to donate blood again. The dataset used for this analysis is available on [Kaggle](https://www.kaggle.com/code/mmmarchetti/predicting-blood-donations).

## Dataset

- **Dataset Link:** [Blood Donation Dataset](https://www.kaggle.com/code/mmmarchetti/predicting-blood-donations)
- **Attributes:**
- Recency (months)
- Frequency (times)
- Monetary (c.c. blood)
- Time (months)

## Approach

1. **Data Understanding and Exploration:**
- Load and explore the dataset.
- Check for missing values, outliers, and data types.
- Visualize the distribution of each feature and the target variable.

2. **Data Preprocessing:**
- Handle missing values.
- Encode categorical variables if needed.
- Scale or normalize numerical features.
- Split the dataset into training and testing sets.

3. **Model Selection and Training:**
- Implement various machine learning models and deep learning methods.
- Train each model on the training dataset.

4. **Model Evaluation:**
- Evaluate each model's performance using metrics like accuracy, ROC AUC, and classification reports.
- Utilize cross-validation for robust performance assessment.

5. **Hyperparameter Tuning:**
- Optimize hyperparameters for selected models to improve performance.
- Use techniques like grid search or randomized search.

6. **Model Comparison:**
- Compare the performance of different models and identify the most effective ones.

7. **Deep Learning:**
- Implement at least one or two deep learning models using TensorFlow/Keras.
- Train and evaluate the deep learning models.

8. **Final Model and Conclusion:**
- Select the best-performing model based on evaluation metrics.
- Summarize the findings and draw conclusions about predictors of blood donation.

## How to Run the Code

1. Clone the repository:

```bash
git clone https://github.com/your-username/blood-donation-analysis.git
cd blood-donation-analysis
Binary file added Blood Donation Prediction/dataset/tranfusion.zip
Binary file not shown.
Loading
Loading