Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Google Analytics Capstone Project #474

Closed
abhisheks008 opened this issue Jan 2, 2024 · 9 comments · Fixed by #687
Closed

Google Analytics Capstone Project #474

abhisheks008 opened this issue Jan 2, 2024 · 9 comments · Fixed by #687
Assignees
Labels
Assigned 💻 Issue has been assigned to a contributor Intermediate Points 30 - SSOC 2024 SSOC

Comments

@abhisheks008
Copy link
Owner

ML-Crate Repository (Proposing new issue)

🔴 Project Title : Google Analytics Capstone Project
🔴 Aim : Create an analysis model for the Google analytics using machine learning.
🔴 Dataset : https://www.kaggle.com/datasets/fredericxiong/google-analytics-capstone-project
🔴 Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


📍 Follow the Guidelines to Contribute in the Project :

  • You need to create a separate folder named as the Project Title.
  • Inside that folder, there will be four main components.
    • Images - To store the required images.
    • Dataset - To store the dataset or, information/source about the dataset.
    • Model - To store the machine learning model you've created using the dataset.
    • requirements.txt - This file will contain the required packages/libraries to run the project in other machines.
  • Inside the Model folder, the README.md file must be filled up properly, with proper visualizations and conclusions.

🔴🟡 Points to Note :

  • The issues will be assigned on a first come first serve basis, 1 Issue == 1 PR.
  • "Issue Title" and "PR Title should be the same. Include issue number along with it.
  • Follow Contributing Guidelines & Code of Conduct before start Contributing.

To be Mentioned while taking the issue :

  • Full name :
  • GitHub Profile Link :
  • Participant ID (If not, then put NA) :
  • Approach for this Project :
  • What is your participant role? (Mention the Open Source Program name. Eg. HRSoC, GSSoC, GSOC etc.)

Happy Contributing 🚀

All the best. Enjoy your open source journey ahead. 😎

@abhisheks008 abhisheks008 added the Up-for-Grabs ✋ Issues are open to the contributors to be assigned label Jan 2, 2024
@abhisheks008 abhisheks008 added Intermediate Points 30 - SSOC 2024 IWOC2024 IWOC 2.0 Open Source Event labels Jan 11, 2024
@Shrutakeerti
Copy link

Hi, @abhisheks008! I would like to take up this issue.
Full name : Shrutakeerti Datta
GitHub Profile Link : https://github.com/Shrutakeerti
Participant ID (If not, then put NA) : N/A
Approach for this Project :

  1. Data Collection and Preparation : Preprocessing the data and ensuring integrity and consistency
  2. EDA: Analyzing the data patterns and identifying the KPIs
  3. Generating the accessible insights through data visualizations
    4)Feature Engineering and Selection and Model training ,Evaluation by selecting the Data's that has been trained
  4. Model Deployment and testing the model for further accuracy

What is your participant role? IWOC2024

@abhisheks008
Copy link
Owner Author

Issue assigned to you @Shrutakeerti

@abhisheks008 abhisheks008 added Assigned 💻 Issue has been assigned to a contributor and removed Up-for-Grabs ✋ Issues are open to the contributors to be assigned labels Jan 12, 2024
@abhisheks008 abhisheks008 added Up-for-Grabs ✋ Issues are open to the contributors to be assigned and removed Assigned 💻 Issue has been assigned to a contributor Intermediate Points 30 - SSOC 2024 IWOC2024 IWOC 2.0 Open Source Event labels Feb 12, 2024
@abhisheks008
Copy link
Owner Author

Unassigned as the open source event ended up.

@why-aditi
Copy link
Contributor

Full name : Aditi Kala
GitHub Profile Link : https://github.com/why-aditi
Participant ID (If not, then put NA) : N/A
Approach for this Project :
Data Collection and Preparation -> EDA -> Model Training -> Model Validation -> Comparing the performance metrics of various models
What is your participant role? SSOC'24

@abhisheks008
Copy link
Owner Author

Full name : Aditi Kala GitHub Profile Link : https://github.com/why-aditi Participant ID (If not, then put NA) : N/A Approach for this Project : Data Collection and Preparation -> EDA -> Model Training -> Model Validation -> Comparing the performance metrics of various models What is your participant role? SSOC'24

Can you share a brief about the planned models?

@fspzar123
Copy link
Contributor

fspzar123 commented Jun 24, 2024

Full name : Filbert Shawn
GitHub Profile Link : https://github.com/fspzar123
Participant ID (If not, then put NA) : N/A
Approach for this project:
Initial processing:

  • Address any values that are missing.
  • If required, standardize or normalize the data.

Illustration:

  • Plot each region's time series data to identify trends and patterns.
  • To see how several regions relate to one another, use correlation charts.
  • Model Selection: Exponential Smoothing, XGBoost, LSTM.
  • Divide the data into sets for testing and training.
  • Utilize various models and assess each one's performance with metrics such as MAE, RMSE, and MAPE.
  • Cross-validation should be done to make sure the model is robust.

Adjusting Hyperparameters:

  • Make use of grid search or random search to fine-tune the parameters of the models I have selected.

Predicting:

  • Utilize the trained model to predict future values and compare them with the test set.

Results visualization:

  • Plot the actual numbers against the projected values to see how well your model is working.

What is your participant role? SSOC 24

@why-aditi
Copy link
Contributor

Full name : Aditi Kala GitHub Profile Link : https://github.com/why-aditi Participant ID (If not, then put NA) : N/A Approach for this Project : Data Collection and Preparation -> EDA -> Model Training -> Model Validation -> Comparing the performance metrics of various models What is your participant role? SSOC'24

Can you share a brief about the planned models?

I'm thinking of using Random Forest, ARIMA, LSTM at the moment

@abhisheks008
Copy link
Owner Author

Full name : Aditi Kala GitHub Profile Link : https://github.com/why-aditi Participant ID (If not, then put NA) : N/A Approach for this Project : Data Collection and Preparation -> EDA -> Model Training -> Model Validation -> Comparing the performance metrics of various models What is your participant role? SSOC'24

Can you share a brief about the planned models?

I'm thinking of using Random Forest, ARIMA, LSTM at the moment

Cool then go ahead. Try to implement 5-6 models for this problem statement.

Assigned this issue to you.

@abhisheks008 abhisheks008 added Assigned 💻 Issue has been assigned to a contributor Intermediate Points 30 - SSOC 2024 SSOC and removed Up-for-Grabs ✋ Issues are open to the contributors to be assigned labels Jun 26, 2024
Copy link

github-actions bot commented Jul 6, 2024

Hello @why-aditi! Your issue #474 has been closed. Thank you for your contribution!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Assigned 💻 Issue has been assigned to a contributor Intermediate Points 30 - SSOC 2024 SSOC
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants