Build a multiple linear regression model by performing EDA and do necessary transformations and select the best model using Python. Dataset Name - 50_startups data. Dataset Name - 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model.
R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in the past few years State -- states from which data is collected Profit -- profit of each state in the past few years
- Follow the Machine Learning Life Cycle Steps.
- Write proper Insights/Inference on each analysis
- Do proper EDA on the columns and the data with graphs using seaborn and interpret the graphs.
- Save the graphs in one folder and zip the folder.
- Zip the datasets folders for 50_startups data.
- Write proper print statements while writing the code. Proper rounding of the numbers is also required.
- Prepare a presentation for both the projects seperately. The presentation should contain -
- Objective
- Solution
- Business Impact