-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Graded Assignment-5 (Jan Term 2024):- Data Visualization Tools #26
Comments
Name - Shri Krishna Pandey Dataset Description
PurposeTo find the relation between Horsepower and mileage (mpg) based on number of Cylinders. Visualization type : Scatterplot
Visualizations1. Microsoft ExcelDetail about the graph: X-axis (MPG) is in range of 5-50, Y-axis (Horsepower) is in range of 0-250 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8) 2. DataWrapper.deDetail about the graph: X-axis (MPG) is in range of 0-50, Y-axis (Horsepower) is in range of 0-250 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8) 3. R-Studio / GG-PlotDetail about the graph: X-axis (MPG) is in range of 0-49, Y-axis (Horsepower) is in range of 0-240 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8) 4. Tableau desktopDetail about the graph: X-axis (MPG) is in range of 5-50, Y-axis (Horsepower) is in range of 0-240 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8) 5. Power BIDetail about the graph: X-axis (MPG) is in range of 5-50, Y-axis (Horsepower) is in range of 45-240 and dots are colored based on number of cylinder (i.e. 3,4,5,6,8) |
Name: Debapriyo Saha The five visualization tools that are being used for plotting are the following:
Dataset used: The dataset contains details of different cars, along with it's different features and also it's miles per gallon (mpg) values. x-axis: Acceleration Color scale is used by Origin and Size of the bubbles are based on the acceleration value (more the acceleration larger is the size of each bubble) 1) Orange Plot2) DataWrapper Plot3) Plotly4) Seaborn & Matplotlib5) Flourish |
One Chart Using 5 ToolsName: Soumya V Namboodiripad Dataset: auto-mpg (Automobile Dataset) Variables Used: mpg, horsepower Type of Chart: Scatterplot 1. Excel2. Datawrapper3. Tableau Pubic4. Flourish5. Matplotlib |
Name: Manaswita Mandal The five visualization tools that I have used for my analysis are:
Data: The dataset contains details of different cars, along with it's different features and also it's miles per gallon (mpg) values. I tried to find some correlation between some of the variables to find whether they are affecting the mpg of different cars.
Following charts are plotted: Bubble Chart using Seaborn:Bubble Chart using Power BiBubble Chart using FlourishBubble Chart using GGPlotBubble Chart Using PlotlyCharts explanation:
Observation:
Inference:
|
Name: Kaushik V Fuel efficiency vs Model Year - How has the distribution changed and has it improved over the years?Observation from the chart: One can see the median fuel efficiency has significantly improved over the years and also the range band has gotten shorter, indicating, irrespective of the brand and specs, car manufacturers have tried to become more fuel efficient across categories. |
For this assinment, I used a dataset The 5 tools that I have used are following:
I learned that different tools are better for different jobs. Some are good for making simple charts fast, others are better for when you need to make something very detailed, and some are best when you want to put a chart on a website. Which tool to use depends on what we need the chart to do. |
Puravasu Jaideep Sesha Dataset : Data Variables used:
Tools Used:
VisualizationsI wanted to try out Datawrapper, but they do not offer boxplots, since they are not very well known to the general public. They offer alternatives like range plot to make up for it. 😊 |
Name: Ajeet Kumar, Scatter Plot: Relation between MPG (miles per gallon ) and Horsepower on the number of Cylinders Variable Used: x-axis: MPG Tools Used: Excel
|
Name - Chandana Nisankara I have chosen Mpg(miles per gallon) and weight from the data , to understand how a car's weight impacts its fuel efficiency. Heavier cars have lower Mpg due to increased energy requirements. Tools / Applications that i have chosen to represent the data are : |
Variables used
PlotsI have done a scatter plot on the given dataset with the three features mentioned above. The tools Matplotlib, Plotly and Flourish were very easy to use. In the case of Google Sheets, it was not possible to color the dots based on the value of the number of cylinders, this prompted the split of the feature in X-axis on the basis of number of cylinders |
Name: Priyanka Nathani Dataset details: My approach: Choice of graph: Tools used:
1. Excel 2. Tableau 3. Power BI 4. Datwrapper 5. Matplotlib 6. RawGraphs Chart Explanation:
Insights into tools:
2. Tableau
6. RawGraphs P.s. I had uploaded the graphs much before the deadline. I has missed the sentence that either excel or matplotlib can be used. I just learnt about it today. Therefore, today I uploaded the 6th graph using RawGraphs. I request you to kindly consider the submission of this graph as well. Thank you and Regards, Priyanka Nathani |
Name - Rajkishore Nandi The five visualization tools used are :
Variables Used :
Plot : Plotted scatter plot with Weight on X-axis, Mpg(Miles per Gallon) on Y-axis and colour gradient for Acceleration to find out the correlation between the three variables. GGPlot using R Plotly Power Bi Flourish Matplotlib |
Name: Sarthak KhandelwalRoll No.: 21f1004405Altair (Python): This scatterplot visualizes the relationship between acceleration and mileage (mpg) based on the number of cylinders using Altair, a Python library for statistical visualization. Each point represents a car in the dataset, with the X-axis showing horsepower, the Y-axis showing mileage, and colors indicating the number of cylinders. The plot provides a clear overview of how the mileage varies with acceleration across different cylinder configurations. Flourish: The Flourish scatterplot displays the correlation between acceleration and mileage (mpg) categorized by the number of cylinders in the car. Each data point represents a car, with acceleration on the X-axis, mileage on the Y-axis, and cylinder count indicated by color. The interactive nature of the plot allows users to hover over points for specific data values and explore how mileage relates to acceleration across different cylinder configurations. Power BI: The Power BI scatterplot illustrates the relationship between acceleration and mileage (mpg) based on the number of cylinders in the car. Each point on the plot represents a car in the dataset, with acceleration plotted on the X-axis, mileage on the Y-axis, and cylinder count represented by color. Users can interact with the plot to filter data or drill down into specific details, making it a versatile tool for exploring the relationship between these variables. Seaborn (Python): This Seaborn scatterplot visualizes the correlation between acceleration and mileage (mpg) categorized by the number of cylinders in the car. Each point represents a car, with acceleration on the X-axis, mileage on the Y-axis, and cylinder count indicated by color. The plot provides insights into how mileage changes with acceleration across different cylinder configurations, with Seaborn's built-in styling and aesthetics enhancing the presentation of the data.
Tableau: The Tableau scatterplot depicts the relationship between acceleration and mileage (mpg) based on the number of cylinders in the car. Each point on the plot represents a car in the dataset, with acceleration plotted on the X-axis, mileage on the Y-axis, and cylinder count differentiated by color. Tableau's intuitive interface allows users to explore the data dynamically, enabling interactive analysis and visualization of how mileage varies with acceleration across different cylinder configurations. |
Name : Trivikram UmanathRollno: 21f1005359Scatter Plot : Relationship between mpg and Weight and Coloured by Model YearTools Used- Matplotlib Variables used- Dataset details: My approach: Choice of graph: We observe as the Years pass the hypothesis is validated i.e with years cars become faster,lighter and stronger and we can see it clearly here. Here are the trends as per the Tools. 1)Matplotlib2)Seaborn3)Pandas Plot4)Flourish5)DataWrapper |
Name : Manish Kumar Draw a scatter plot between horsepower and acceleration using these different tools 1 Google sheet2 power BI |
Title: Scatter plot for vehicle Weight and MPG with the number of cylinders
Here we will see the plotting of the vehicle Weight and MPG (Miles per Gallon) with the number of cylinders in that vehicle by using scatter plot and observing the differences among different visualization tools for the same plot. Data Used: auto-mpg.csvFeatures Used:
Tools Used:
Here I have taken the dark backgrounds for all plots because as this is posted in GITHUB, it's entire background is in the dark. So, it will be visually compelling when someone observes the plot without much stress to the eyes. Also, the color for data points are chosen in a way that it can easily notice. 1. MatplotlibIn Matplotlib, the entire plot comes with very thick borders and labels. The data points are also a little blurry in nature. I have used the spring color palette for this plot. 2. SeabornIn Seaborn, the quality of the plot is very good. The data points are also in appropriate size with a decent opacity in the centers which makes the data points differentiable and can be noticed easily. I tried increasing the dpi in matplotlib but the image quality still looks the same but for the seaborn, it works perfectly fine. For seaborn also, I have used the spring color palette. 3. PlotlyIn Plotly, it mainly offers interactive plots. Although the quality of the plot looks a little low but the features that it has with the interactive zooming, box select, lasso select the data points that we are interested in. All these are visualization libraries that are mentioned above. All these libraries have the color palette inbuilt but most of the color palettes are with light color on one end and darker on the other end. As I have selected the darker background, light color palettes will be well suitable for my visualizations. Hence, I preferred the spring palette in Matplotlib and Seaborn. But unfortunately, there is no spring palette in Plotly. As I want to show the differences among various tools, I wanted it to be in the same palette for every tool. For this issue, I have taken the RGB values for the spring palette from Matplotlib and Seaborn and have used those values manually in Plotly and other tools that we are going to see next. 4. Google SheetsGoogle Sheets are mainly useful for quick and simple generated charts. In sheets, the scatter plot simply takes two variables and plots it with a single color. There is no direct option to consider the 3rd feature and color it accordingly. To achieve it, we have to split the mpg feature into multiple features according to the cylinders. 5. FlourishFlourish is well popular for visualizations. It is very easy to implement. Copy pasting the data in the desired visualization template and can customize the chart according to our needs. But still it offers only a limited number of visualizations. Conclusion: After these plotting in different tools and observed that if we are using visualization libraries, I would recommend Seaborn or if we want interactive visualization from the libraries, we can go for Plotly. From the visualization tools, google sheets can be useful only for some quick plots by analyzing or filtering the datasets. We can achieve some rough visualizations from Google Sheets and can apply those into bigger visualization tools like Flourish. If we are planning for more unique and different visualizations which are not offered by any of the tools above, we can go for some other tools like Tableau, Power BI etc. |
Bar Plot of Average MPG per Cylinder Type of a Vehicle using 5+ Charting Libraries/ LanguagesDetailsName: Kruthiventi M R S Sai Charan Approaches:1. Matplotlib(Python):
2. Bokeh(Python):
3. Plotly(Python):
4. Fourish(Web-Based Open Source):
5. Altair(Python): |
Name: Gokulakrishnan B Idea behind visualizationThe first thought that came to mind is that heavy weight vehicles consume more fuel to move, thus the mileage will be low. My intuition is that mpg is inversely proportional to weight. Higher the mpg, lighter the vehicle. So I am going to plot mpg vs weight scatter plot to observe the pattern and validate my assumption. G-sheetsLibreoffice CalcMatplotlibPlotlyAltair |
Name - Aditi Krishana Dataset Description Dataset Used - Automobile Dataset Given dataset had 9 columns:
Purpose: The goal is to explore the relationship between horsepower and mileage (mpg) based on the number of cylinders in the car. We want to understand how these factors are interconnected and whether the number of cylinders affects the fuel efficiency of the vehicle. Visualization type : Scatterplot Preprocessing Steps:
Data Summary:
This scatter plot shows the relationship between horsepower and mileage (mpg). |
Name: Mukesh K The dataset contains automotive fuel economy (in miles per gallon or mpg) and associated vehicle characteristics such as cylinders, displacement, horsepower, weight, acceleration, model year, origin, and car name. Features used: The following tools were used to plot the data containing engine displacement vs mpg (miles per gallon) |
Name : Abhishek Gupta For this task, I utilized the auto-mpg.csv dataset to generate a bubble chart across five distinct platforms. A bubble chart, akin to a scatter plot, employs circles to convey additional data. Specifically, I selected the variables of mpg (miles per gallon) and displacement (engine size) from the dataset. The size of the bubbles corresponded to the mpg values of the car engines. I have used these variables because they have high correlation. Flourish Data Wrapper: DataStudio Excel: Flourish: Matplotlib : |
Name : ADITYA DHAR DWIVEDI For this assignment, I used a dataset auto-mpg.csv to make a bubble chart with five different tools. I picked weight and displacement variables from the dataset, and the size of the bubbles was determined by the cylinders of the car engines. The 5 tools that I have used are following:
|
Name: HARSH BARDHANRoll No- 21F1004807In this comparative evaluation, we explore the relationship between 1. Excel Scatter Plot:Description: Excel offers a user-friendly interface for creating scatter plots. While it lacks advanced features, it's suitable for quick visualizations. 2. Flourish Scatter Plot:Description: Flourish is an online tool for creating interactive visualizations. It allows for more interactivity compared to other tools but requires uploading data. 3. Datawrapper Scatter Plot:Description: Datawrapper provides a simple yet effective way to create scatter plots. Its intuitive interface makes it easy to customize visualizations. 4. Seaborn Scatter Plot:Description: Seaborn, a Python library, offers a high-level interface for drawing attractive statistical graphics. It's particularly useful for exploring datasets with many variables. 5. Matplotlib Scatter Plot:Description: Matplotlib is a versatile plotting library for Python. It allows for fine-grained control over plot elements, making it suitable for creating publication-quality visualizations. These scatter plots provide insights into the relationship between the number of cylinders , acceleration, weight and displacement of automobiles, showcasing the strengths and features of each visualization tool. |
Name : Purva Sharma 1. About the Data: 2. Features Utilized: For this analysis, the following features were used: Weight: Represented on the x-axis. 3. Tools Used: 5 visualization tools were employed to analyze and visualize the data: |
Title: Miles per gallon vs Horsepower w.r.t. No. of cylindersX-axis : Miles per gallon The creation of visualizations involved preprocessing of the data which included removal of rows with null value in horsepower. The chart chosen is scatterplot on the following tools:
Here are the visualizations: 1. Microsoft Excel2. Flourish3. Tableau Public4. Datawrapper5. Power BI |
Variables Used:
Chart Type: Scatterplot
Purpose:To identify relationship between the Weight of the car and its corresponding MPG while considering the no. of cylinders in the car. Tools/Libraries used:
Visualizations:1) Flourish2) Datawrapper3) Tableau4) Matplotlib5) Seaborn |
With a plethora of both commercial & free visualization tools & libraries available, it can often be confusing to pick the right tool for your requirement. Also from the learning point of view, one doesn't know which tool or set of tools should invest time & effort in learning.
In her 2016 article "What I Learned Recreating One Chart Using 24 Tools", Lisa Charlotte Rost tried out 12 data vis applications and 12 data vis libraries and programming languages and reported a comparative evaluation.
In this assignment, you will recreate the exercise with at least 5 charting tools or libraries (total 5 not 5 each) for the given dataset (auto-mpg.csv). You may create any chart type, but using at least 2 variables from the dataset. Having decided on chart type & variables, repeat the same chart using the 5 chart tools or libraries. Paste your charts as a comment to this issue. Add text to each chart identifying the tool/library you used for the chart.
Note: You can only use one from Matplotlib, seaborn, and Excel.
The text was updated successfully, but these errors were encountered: