ML-Poverty-Prediction

Introduction

This repository houses the ml-poverty-prediction project, which focuses on the Prediction of Poverty and Malnutrition Prevalence with diffrent Machine Learning techniques. This project is based on the research paper "Multivariate Random Forest Prediction of Poverty and Malnutrition Prevalence" (research paper). However, our approach diverges by employing a variety of different Machine Learning (ML) techniques to potentially enhance prediction accuracy and model effectiveness.

Also, a synthesized version of the paper is available in the form of a research article, which can be accessed here (research article).

Project Overview

Objective

The primary objective of this project is to develop a ML model that can accurately predict poverty and malnutrition prevalence, leveraging the data used in the aforementioned research but exploring different ML methodologies.

Goals

Validation and Enhancement: We aim to validate the findings of the original research and seek ways to enhance the accuracy of poverty and malnutrition predictions.
Model Comparison: A key goal is to compare the effectiveness of various ML models in predicting poverty and malnutrition prevalence, thus contributing to the broader field of socio-economic predictive analytics.

Research Approach and Methodology

1. Initial Development Phase

In the initial phase of our project, we are adopting a unique approach to model development. This involves:

Learning Experience: We are intentionally developing our initial machine learning models without consulting the research paper titled "Multivariate Random Forest Prediction of Poverty and Malnutrition Prevalence". The purpose of this approach is to foster an unbiased learning and discovery process, allowing our team to explore and test various methodologies based on our existing knowledge and hypotheses.
Model Development: Our team will develop and train machine learning models using the available data, focusing on predicting poverty and malnutrition prevalence. We aim to explore a range of algorithms and techniques, distinct from those used in the aforementioned research paper.

2. Integration and Refinement Phase

Once the initial models are developed and evaluated, our methodology will evolve to include:

Research Integration: After the initial development phase, we will carefully study the "Multivariate Random Forest Prediction of Poverty and Malnutrition Prevalence" paper. This will provide us with new insights, methodologies, and potential improvements that could be applied to our models.
Model Refinement: Leveraging the knowledge gained from the paper, we will refine, adjust, and possibly retrain our models. This phase is critical for integrating best practices, novel techniques, and insights gleaned from the paper into our existing framework.
Comparative Analysis: The outcomes of our models will be compared against the findings in the research paper. This will not only validate our initial models but also help in understanding the effectiveness of different machine learning approaches in predicting poverty and malnutrition.

Goal of This Approach

The rationale behind this two-phase approach is to enhance the learning experience of our team, encourage innovative thinking, and ultimately develop a robust predictive model that can be benchmarked against established research. This method ensures a comprehensive understanding and application of machine learning techniques in the realm of socioeconomic predictions.

For further details or inquiries about our research approach, feel free to reach out as outlined in the Contact Information section.

Data Utilization

(Here, you would detail the types of data used, how you plan to gather, process, and utilize this data. Be specific about data sources, preprocessing steps, and how this data will be split for training and testing purposes.)

Data from the paper here.

Preprocessing steps
Train test split

Methodology

(Describe the different ML techniques you plan to use. This section should detail why these methods were chosen, how they differ from the original research paper's methodology, and their expected impact on the project's goals.)

-- AutoML libraries -- Quantum ML Libraries -- Neural Nets

Challenges and Solutions

(Identify potential challenges in data gathering, model development, and implementation. Also, discuss the strategies you will employ to overcome these challenges.)

Impact Assessment

(Explain how you will measure the success of the project. Include metrics for both technical performance, such as model accuracy, and real-world impact, such as improvements in predicting poverty and malnutrition.)

Future Directions

(Outline any potential future developments or extensions of this project. How could this work be scaled, or what other areas could it potentially impact?)

Tech stack

Zotero + Ellicit - literature review Obsidian - notes Eraser - graphs Github - repo Gitpod - cloud dev env

Linux - Python - Kedro - Docker etc

Run and discover the experiments

Run pipeline locally

kedro run

kedro viz run

kedro jupyter lab --ServerApp.allow_remote_access=True

Run pipeline in Docker image

pip install kedro-docker

kedro docker init

kedro docker build

kedro docker run

kedro docker cmd --docker-args="-p=4141:4141" kedro viz --host=0.0.0.0

docker does not contain data just pipeline so you will not see data

Contributing

(Provide guidelines on how others can contribute to your project. This may include instructions for submitting issues, pull requests, and contact information for direct communication.)

License

(State the license under which your project is released, if applicable.)

Acknowledgements

(Give credit to individuals, organizations, or papers that have contributed significantly to your project.)

Contact Information

(Provide your contact information or that of the main contributors for further inquiries or collaboration.)

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
pipeline		pipeline
.gitpod.yml		.gitpod.yml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML-Poverty-Prediction

Introduction

Project Overview

Objective

Goals

Research Approach and Methodology

1. Initial Development Phase

2. Integration and Refinement Phase

Goal of This Approach

Data Utilization

Methodology

Challenges and Solutions

Impact Assessment

Future Directions

Tech stack

Run and discover the experiments

Run pipeline locally

Run pipeline in Docker image

Contributing

License

Acknowledgements

Contact Information

About

Releases

Packages

Languages

AdamPrzychodniPrivate/ml-poverty-prediction

Folders and files

Latest commit

History

Repository files navigation

ML-Poverty-Prediction

Introduction

Project Overview

Objective

Goals

Research Approach and Methodology

1. Initial Development Phase

2. Integration and Refinement Phase

Goal of This Approach

Data Utilization

Methodology

Challenges and Solutions

Impact Assessment

Future Directions

Tech stack

Run and discover the experiments

Run pipeline locally

Run pipeline in Docker image

Contributing

License

Acknowledgements

Contact Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages