Skip to content

Commit

Permalink
Merge pull request #590 from stackaway/main
Browse files Browse the repository at this point in the history
LinkedIn Poll Data Analysis
  • Loading branch information
abhisheks008 committed Feb 24, 2024
2 parents 6c581aa + f974c9b commit d6d1add
Show file tree
Hide file tree
Showing 12 changed files with 772 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,101 @@
Quiz_number,Total_Views,Total_Responses,Right_Answers,Total_Likes,Avg_right,Max_Right
1,134100,3486,1424,84,41%,Yes
2,172686,5669,4093,139,72%,Yes
3,113495,2128,1175,74,55%,Yes
4,73924,347,142,33,41%,Yes
5,100538,1559,917,54,59%,Yes
6,83232,769,246,51,32%,Yes
7,87765,1244,603,65,48%,Yes
8,56598,469,134,31,29%,Yes
9,98162,2081,1119,67,54%,Yes
10,56957,557,205,31,37%,No
11,73472,1039,817,24,79%,Yes
12,72205,994,637,35,64%,Yes
13,53785,435,184,34,42%,Yes
14,63371,616,253,29,41%,Yes
15,152686,3053,647,81,21%,No
16,92686,2294,1720,58,75%,Yes
17,50796,648,131,32,20%,No
18,50632,520,252,18,48%,Yes
19,43791,522,318,22,61%,No
20,52736,1064,804,45,76%,Yes
21,48729,733,453,31,62%,Yes
22,50790,864,578,25,67%,Yes
23,37380,644,403,27,63%,Yes
24,46674,523,173,28,33%,No
25,46154,1022,793,35,78%,Yes
26,54535,664,401,36,60%,Yes
27,48071,450,327,30,73%,Yes
28,41423,966,835,28,86%,Yes
29,67950,1606,1367,48,85%,Yes
30,61879,1218,652,49,54%,Yes
31,84481,835,378,34,45%,Yes
32,53133,633,412,37,65%,Yes
33,70442,735,624,55,85%,Yes
34,49672,403,61,29,15%,Yes
35,57934,506,276,27,55%,Yes
36,38206,260,90,23,35%,Yes
37,30522,274,166,16,61%,Yes
38,39351,378,133,18,35%,Yes
39,33210,400,277,24,69%,Yes
40,59164,933,720,35,77%,Yes
41,44940,429,235,22,55%,Yes
42,49975,512,344,18,67%,Yes
43,36452,315,194,11,76%,Yes
44,51477,813,593,35,73%,Yes
45,44614,443,363,28,82%,Yes
46,49772,645,442,16,69%,Yes
47,36931,316,59,20,19%,No
48,34718,321,64,30,20%,No
49,35950,222,129,18,58%,Yes
50,45536,570,414,26,73%,Yes
51,65696,903,527,40,58%,Yes
52,33024,313,135,22,43%,Yes
53,58197,913,698,34,76%,Yes
54,48735,614,408,41,66%,Yes
55,49525,578,346,29,60%,Yes
56,49472,546,249,38,46%,Yes
57,49523,380,54,32,14%,No
58,50450,426,215,24,50%,Yes
59,90300,1746,1448,68,83%,Yes
60,31090,190,133,25,70%,Yes
61,80224,1388,899,55,65%,Yes
62,47464,506,343,36,68%,Yes
63,56559,706,514,36,73%,Yes
64,55279,875,617,22,71%,Yes
65,54905,849,691,39,81%,Yes
66,30078,352,53,23,15%,No
67,43791,617,491,22,80%,Yes
68,42955,495,133,19,27%,No
69,58314,871,688,46,79%,Yes
70,44089,597,318,23,53%,Yes
71,41417,451,255,28,57%,Yes
72,45161,517,253,26,49%,Yes
73,47645,498,291,43,58%,Yes
74,37878,344,255,16,74%,Yes
75,27687,216,74,16,34%,Yes
76,44673,847,587,26,69%,Yes
77,45839,563,149,30,26%,No
78,53115,455,94,17,21%,No
79,125486,1653,1036,62,63%,Yes
80,35211,357,173,24,48%,Yes
81,34632,402,266,17,66%,Yes
82,33647,298,99,11,33%,No
83,72097,1042,523,47,50%,Yes
84,49314,627,446,35,71%,Yes
85,68008,906,557,58,61%,Yes
86,24307,157,37,10,24%,No
87,62538,824,101,41,12%,No
88,56659,1241,875,34,71%,Yes
89,60886,1290,1067,37,83%,Yes
90,37436,353,183,22,52%,Yes
91,100582,1525,731,35,48%,Yes
92,85187,1385,576,40,42%,Yes
93,76218,770,500,35,65%,Yes
94,74333,1235,851,46,69%,Yes
95,100386,2001,1414,52,71%,Yes
96,111743,1385,769,64,56%,Yes
97,53757,504,355,30,70%,Yes
98,84803,1182,488,62,41%,No
99,72729,730,339,50,46.44%,Yes
100,84737,1674,855,57,51.08%,Yes
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added LinkedIn Poll Data Analysis/Images/heat_map.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added LinkedIn Poll Data Analysis/Images/smv.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
586 changes: 586 additions & 0 deletions LinkedIn Poll Data Analysis/Model/Linkedln_poll_analysis.ipynb

Large diffs are not rendered by default.

80 changes: 80 additions & 0 deletions LinkedIn Poll Data Analysis/Model/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,80 @@
**LINKEDIN POLL DATA ANALYSIS**

**GOAL**

To analyse the dataset and see the popularity of the Linkedln Poll.
<!-- Write the main goal of project and what's the purpose of it -->

**DATASET**

<!-- Add a link to dataset and from where it's taken. -->
https://www.kaggle.com/datasets/kalilurrahman/linkedin-poll-data

**DESCRIPTION**

<!-- Brief description about the project -->
The dataset compiles quiz data gathered from LinkedIn Polls, providing insights into various topics.

**WHAT I HAD DONE**

<!-- Write down the step by step procedure of how project works using points. -->
* Analyzed data, extracted insights, and generated relevant visualizations.
* Process data to prepare it for machine learning model training.
* Trained default-parameter models:
* Linear Regression
* Decision Tree
* Random Forest
* SVM

* In this, Linear Regression performed best with R2_score: 0.8196 and MSE: 0.0000. (Refer : `LinkedIn_poll_analysis.ipynb`)

**MODELS USED**

<!-- List out all the algorithms or models used in this project -->
<!-- Why have you choosed that algorithms should also be stated -->
* Linear Regression - Chosen for its simplicity, interpretability, and ease of implementation.
* Decision Tree - Chosen for its ability to handle non-linear relationships and its interpretability.
* Random Forest - Aggregates the predictions of multiple decision trees trained on random subsets of the data and features.
* SVM - Chosen for its effectiveness in high-dimensional spaces and its ability to handle non-linear relationships.

**LIBRARIES NEEDED**

<!-- Add all the libraries needed in this project in points -->
* Pandas V2.0.3
* Numpy V1.24.3
* Matplotlib V3.7.2
* Scikit-learn V1.3.2
* Seaborn version: 0.12.2

**VISUALIZATION**

<!-- INCLUSION OF IMAGES OF THE VISUALIZATION IS MUST (RESULT OF EDA). -->
![All Features](../Images/heat_map.png "All Features")


![Total Responses](../Images/tot_responses.png "Total Responses")

**ACCURACIES**

<!-- Add all the algorithms used with their accuracies and results -->

* Linear Regression - Mean Squared Error (MSE): 0.0000, R-squared (R2) Score: 0.8196
* Decision Tree - Mean Squared Error (MSE): 0.0000, R-squared (R2) Score: 0.4230
* Random Forest - Mean Squared Error (MSE): 0.0000, R-squared (R2) Score: 0.6487
* SVM - Mean Squared Error (MSE): 0.0001, R-squared (R2) Score: -2.3062


**CONCLUSION**

<!-- What's the conclusion derived from this project and also showcase the accuracy results if it's applicable. Be briefer -->
We analyze, preprocess, and visualize the features. Then, we calculate the quiz popularity based on views and answers received. Finally, we explore quiz popularity using various ML models along with other parameters.

Linear Regression model is the best fit - Mean Squared Error (MSE): 0.0000, R-squared (R2) Score: 0.8196.

**YOUR NAME**

<!-- Add your name at the end of the file, along with social media handles if applicable. -->
*Churnika S Mundas*


[![LinkedIn](https://img.shields.io/badge/linkedin-%230077B5.svg?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/in/churnika-mundas-64767b246/) [![GitHub](https://img.shields.io/badge/github-%23121011.svg?style=for-the-badge&logo=github&logoColor=white)](https://github.com/stackaway)
5 changes: 5 additions & 0 deletions LinkedIn Poll Data Analysis/requirements.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
* Pandas V2.0.3
* Numpy V1.24.3
* Matplotlib V3.7.2
* Scikit-learn V1.3.2
* Seaborn version V0.12.2

0 comments on commit d6d1add

Please sign in to comment.