Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Predicting Airbnb Listing Prices in New York City #671

Closed
awesohame opened this issue Jun 19, 2024 · 8 comments · Fixed by #725
Closed

Predicting Airbnb Listing Prices in New York City #671

awesohame opened this issue Jun 19, 2024 · 8 comments · Fixed by #725
Assignees
Labels
Assigned 💻 Issue has been assigned to a contributor Intermediate Points 30 - SSOC 2024 SSOC

Comments

@awesohame
Copy link

ML-Crate Repository (Proposing new issue)

🔴 Project Title : Predicting Airbnb Listing Prices in New York City
🔴 Aim : Predict Airbnb listing prices based on various features.
🔴 Dataset : https://www.kaggle.com/datasets/dgomonov/new-york-city-airbnb-open-data
🔴 Approach :

  1. Exploratory Data Analysis (EDA):

    • Analyze data distribution, patterns, and anomalies.
    • Visualize relationships between features and price.
  2. Data Preprocessing:

    • Handle missing values.
    • Drop unnecessary columns.
    • Encode categorical variables.
    • Scale numerical features.
  3. Model Training and Evaluation:

    • Split data into training and testing sets.
    • Train regression models:
      • Linear, Decision Tree, Random Forest, Gradient Boosting.
    • Evaluate with metrics like MAE, MSE, R2.
    • Compare model performance to select best fit.

📍 Follow the Guidelines to Contribute in the Project :

  • You need to create a separate folder named as the Project Title.
  • Inside that folder, there will be four main components.
    • Images - To store the required images.
    • Dataset - To store the dataset or, information/source about the dataset.
    • Model - To store the machine learning model you've created using the dataset.
    • requirements.txt - This file will contain the required packages/libraries to run the project in other machines.
  • Inside the Model folder, the README.md file must be filled up properly, with proper visualizations and conclusions.

🔴🟡 Points to Note :

  • The issues will be assigned on a first come first serve basis, 1 Issue == 1 PR.
  • "Issue Title" and "PR Title should be the same. Include issue number along with it.
  • Follow Contributing Guidelines & Code of Conduct before start Contributing.

To be Mentioned while taking the issue :

  • Full name : Soham Aversekar
  • GitHub Profile Link : https://github.com/awesohame
  • Participant ID (If not, then put NA) : NA
  • Approach for this Project : Perform Exploratory Data Analysis (EDA) to understand data distribution and visualize feature relationships. Preprocess data by handling missing values, dropping unnecessary columns, encoding categorical variables, and scaling numerical features. Train multiple regression models, evaluate using metrics like MAE, MSE, and R2, and select the best-performing model.
  • What is your participant role? (Mention the Open Source Program name. Eg. HRSoC, GSSoC, GSOC etc.) : VSoC

Happy Contributing 🚀

All the best. Enjoy your open source journey ahead. 😎

@awesohame awesohame added the Up-for-Grabs ✋ Issues are open to the contributors to be assigned label Jun 19, 2024
Copy link

Thank you for creating this issue! We'll look into it as soon as possible. Your contributions are highly appreciated! 😊

@awesohame
Copy link
Author

Please assign me @awesohame this issue under VSoC.

@abhisheks008
Copy link
Owner

Can you implement 6-7 models for this project?

@awesohame
Copy link
Author

@abhisheks008 Yes, I can train 6-7 models

@abhisheks008
Copy link
Owner

Assigned @awesohame

@abhisheks008 abhisheks008 added Assigned 💻 Issue has been assigned to a contributor MEDIUM VSOC - 10 Points VSoC’24 Vinyasa Summer of Code (VSOC) and removed Up-for-Grabs ✋ Issues are open to the contributors to be assigned labels Jun 21, 2024
@abhisheks008 abhisheks008 added Up-for-Grabs ✋ Issues are open to the contributors to be assigned and removed Assigned 💻 Issue has been assigned to a contributor MEDIUM VSOC - 10 Points VSoC’24 Vinyasa Summer of Code (VSOC) labels Jul 20, 2024
@adi271001
Copy link
Contributor

adi271001 commented Jul 27, 2024

Full name : Aditya D
GitHub Profile Link : https://github.com/adi271001
Participant ID (If not, then put NA) : NA
Approach for this Project :

  1. Data Cleaning
  2. Data preprocessing
  3. EDA
  4. Feature Engineering
  5. Modelling and evaluation
  6. Hyperparameter Tuning
  7. Feature Importance
    What is your participant role? (Mention the Open Source Program name. Eg. HRSoC, GSSoC, GSOC etc.) : SSOC Contributor
    @abhisheks008 please assign this to me if still open

@abhisheks008
Copy link
Owner

Assigned @adi271001

@abhisheks008 abhisheks008 added Assigned 💻 Issue has been assigned to a contributor Intermediate Points 30 - SSOC 2024 SSOC and removed Up-for-Grabs ✋ Issues are open to the contributors to be assigned labels Jul 28, 2024
@abhisheks008 abhisheks008 linked a pull request Aug 2, 2024 that will close this issue
12 tasks
Copy link

github-actions bot commented Aug 2, 2024

Hello @adi271001! Your issue #671 has been closed. Thank you for your contribution!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Assigned 💻 Issue has been assigned to a contributor Intermediate Points 30 - SSOC 2024 SSOC
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants