-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Graded Assignment -4 (May Term 2024):- Redesigning The Hindu Data Point Stories #31
Comments
---WIP--- Original Article: Share of Women across Employment Sectors (link)Summary: Key Insights:
The provided chart is a scatter plot with circles representing states, differentiated by regions through colors. It categorizes women workers into unpaid family members, informal/formal hired workers, and working owners in various sectors of unincorporated enterprises. The southern states are positioned towards the right, indicating a higher share of women in the workforce. Visual Critique:
Redesign IterationsPS: All charts below are interactive. Tooltips provide further information about the dataset Iteration-1In this attempt, I tested out Strip Plot on a mock, comparatively smaller dataset.
Iteration-2Here, I created Treemaps, one for each employment type, sorted by Percentage share of women.
Iteration-3Furthermore, I visualized the story in the form of a Heatmap.
Final IterationFinally, I improved upon the last iteration and presented the story using a Split Bar Chart
|
Name: Ashutosh Kumar Barmwal Documentation Redesign DocumentationStory the Author is Trying to TellThe author highlights a recent decline in economic confidence among urban households in India. The focus is on four key areas:
The narrative emphasizes that after a steady recovery post-COVID, confidence in the economy has recently dipped. Data Used to Tell the StoryData Details
Essential vs. Irrelevant Data
Visual Encoding and ProblemsCurrent Encoding
Improvements Attempted
Redesign Process
Redesigning Charts: Trying to get the data. Thank You, |
Name: Neeraj Yadav Main Story: Data Used: Type of Data: Quantitative data on student scores from the NEET UG 2024 exams. Current Visual Encoding: Chart 1: A scatter chart displaying the percentage of students scoring above 650 marks across different cities. Table 2: A table listing the top centers with the highest share of candidates scoring above 650 marks. Problems with Current Encoding: Scatter Chart: Redesigning the Visualization Improvement Plan: Simplify and Clarify: Create clearer, more intuitive charts that highlight key insights without overwhelming the viewer. Redesigned Visualizations: Bar Chart: Displaying the top cities with the highest share of students scoring above 650 marks. Redesigned Heat Map: Documentation Original Story: Link to the original story: NEET UG 2024: Data reveals top cities for high-scoring candidates Redesign Documentation: Bar Chart: The bar chart simplifies the data by focusing on the top cities, making it easier to compare their performance. These redesigned visualizations aim to improve the clarity and storytelling of the data, making it more accessible and easier to interpret for the audience. |
About MeName: Bhumika Taneja Original Article : Diseases with higher burden in Asia and Africa lack research fundingWhat is the author trying to convey with this story?The author highlights the significant disparity in research funding and attention between neglected tropical diseases (NTDs) and more prominent diseases like COVID-19, HIV/AIDS, tuberculosis, and malaria. Despite the massive burden these diseases place on impoverished populations in tropical and subtropical regions, they receive substantially less funding and resources. This underfunding perpetuates a cycle of poverty and disease, causing long-term disabilities, social stigma, and economic burdens that hinder development and deter investment in treatments. The article underscores the urgent need for increased funding and attention to NTDs to break this cycle and alleviate the suffering of millions. Key Points:
What data is he/she using to tell the story?The following charts are included in the original Hindu article. Plot 1: Research Funding by Disease in 2022 Type of Data: Extent of the Data: Dimensions of the Data Gaps in the Data: Essential Data: Plot 2: Research funding for different health technologies from 2007 to 2022. Type of Data: Extent of the Data: Dimensions of the Data: Gaps in the Data: Essential Data: I will be redesigning the second plot for the purpose of this assignment. Data Encoding and Potential ImprovementsCurrent Encoding
Problems with the Current Encoding
Suggested Improvements
Redesigned PlotPlot Details
Encoding Improvements
|
Name: Suraj ARS Original Article : On unemployment in Indian States Story of article in view of author The article provides an analysis of unemployment in major Indian states, excluding Union Territories, using data from the Periodic Labour Force Survey (PLFS) of 2022-23. It focuses on individuals aged 15 and above and highlights the disparities in unemployment rates across different states. Goa has the highest unemployment rate at almost 10%, followed by other relatively wealthy states like Kerala, Haryana, and Punjab. The analysis reveals that states with a higher proportion of self-employment have lower unemployment rates, and more urbanized states tend to have higher unemployment rates due to fewer informal job opportunities. The link between education and unemployment is also explored, showing that states with a higher percentage of educated individuals, such as graduates, tend to have higher unemployment rates, possibly due to a mismatch between skills and job requirements or because graduates aspire to high-wage jobs that are not available in sufficient numbers. Key Findings
Charts present in hindu article Chart 1: Umemployment across Indian States Type of Data: Quantitative data on Unemployment rates across Indian States 2022-23 Chart 2: Self Employment Vs Unemployment Type of Data: A scatter plot data of self employment vs unemployment Current Encoding Horizontal Bar Graph : Each State represented with bar about unemployment rate Problems with the Current Encoding Horizontal Representation : The horizontal representation of Indian States, making it challenging to recognize them. Suggested Improvements Vertical Representation : The vertical representation of Indian States, making it easier to recognize them. Redesigned Plot Plot Details Pie Chart: This reprsentation conveys top five states unemployment rates and rest of states fall under others classification. |
Assignment 4 Name - Tripti Arya Link to the Original Article: Nepal’s treacherous skies : With 741 plane crash deaths, country ranks 11 of 207 nations The Story Behind the chosen article Key Findings
Provided chart for better understanding Chart 2: Nepal and other countries in terms of Total fatalities due to Plane crashes. Description for Chart 1 and 2:
Chart 3: Fatalities in crashes against departures by air carriers in different countries Description for the above chart:
Chart 4: Number of air crashes and fatalities of each airline in Nepal Description for the above chart: 1.The y-axis lists different Nepalese airlines. Problems and Improvements
Improvements:
Redesign for given charts 1. Replacing Treemap(showing Fatalities data due to plane crashes) with Ordered Bar chart 2. Bar graph showing Fatalities over plane crashes Conclusion: |
Name: Kirupa Krishan GRoll_No: 21f1006352Story Overview:The article discusses the significant increase in the cost of a home-cooked vegetarian meal (thali) in Maharashtra over the last five years compared to the relatively modest salary rise between salaried and daily wage labourers. The key point is the growing disparity between food costs and income, highlighting the strain on households, especially those with daily wages. Source : Link Data Used:Type of Data:
Extent and Dimensions:
Gaps in the Data:
Data Details:
Data Encoding:
Problems and Improvements:Problems:
Improvements:
Chart 1:Chart 2:**Chart
|
Name: Prashant Sharma Title: Which topics are India’s researchers publishing papers on?Data Source: https://www.thehindu.com/data/which-topics-are-indias-researchers-publishing-papers-on/article68410121.ece 1: Story the Author is Trying to Tell:
2: Data Used to Tell the Story:
3: Encoding and Problems:
Existing Graphs: Chart 2 | The chart ranks the five topics under which the highest number of papers were published (2004- 2023) in select nations. Graph Additions: To make the intended story more impactful and easy to understand from comparision point of view: (1) Visualize the comparisons of overall research output of countries in the last 5 years and last 2 decades per category.
(2) Visualize country-wise contributions in research output per research category of top 5 research areas in last 5 years and last 2 decades.
(3) Visualize the comparisons of overall research output per top 5 research area in last 5 years and last 2 decades.
(4) Visualize the qualitative comparisons of overall research publication per category per country in the last 5 years and last 2 decades as heatmap.
(5) Visualize the publication contributions as % contribution in the research categories of top 5 research areas in the last 2 decades and in the last 5 years.
|
Name : Irshad Sareshwala Original Story :2024 polls: How people in high and low income areas voted in Chennai’s Mylapore, T Nagar and other areas(https://www.thehindu.com/data/2024-polls-how-people-in-high-and-low-income-areas-voted-in-chennais-mylapore-t-nagar-and-other-areas/article68427083.ece)
The original story aims to analyze the voting patterns in Chennai's Mylapore, T Nagar, and other areas based on income levels. It shows that the DMK has a stronghold among urban poor voters, while the BJP has better support among wealthier voters.
The data used in the story includes: Polling station data listing areas and polling stations. Type of data: Quantitative (votes, guideline values)
Original Encoding: The original visualization uses scatter plots with red and blue dots representing DMK and BJP vote shares, respectively, across different streets/areas. The scatter plot may not clearly show the relationship between income and voting patterns. Use a more intuitive visualization method, such as a bar chart or heatmap, to show the correlation between income levels and vote shares. Redesigning the visualization: Conclusion: |
Name : Rajesh Saha Subject : A green wealth tax in Budget 2024 Article Link :A green wealth tax in Budget 2024 Author's story:The author is proposing a green wealth tax for Indian Elite (top 10% of Indian Population in terms of CO2 emission). The author has also shown that this tax would decline over time, still the target would be achieved. To support this proposal, the author has shown 3 cases - 1) CO2 emission by Indian Elites comparing with developed countries and Indian average population, 2) How much money India needs to tackle IGD (Indian Green Deal), 3) How this can be achieved. Data used by author's :The author has used numerical data (in USD), categorical data (Elite Indians, Average Indians), Ratio (CO2 emission). The source of the data was not mentioned in the article. Data Collection for redesign:Data Collected from the above links and visualization. Data has been collected manually by hovering mouse at different data points. Redesign attemptFor redesigning purpose, all the above visualizations would be visited and then modifications would be done with the same data. 1. CO2 emission of Indian Elites comparing with developed countries and Indian average populationThe author has divided this into 2 parts - A) Comparison of CO2 emission among developed country, Indian Elite and Average Indian Elite, B) The sector-wise comparison of CO2 emission between Indian elite and average Indian population. 1A. Comparison of CO2 emission among developed country, Indian Elite and Average Indian Elite In the above, the story is clearly coming that how Indian elite is catching with the emissions in developed countries and how that is higher than average Indian. So, I have redesigned this in the same line. 1B. The sector-wise comparison of CO2 emission between Indian elite and average Indian population There are 2 problems here - 1) For ratio, the author has chosen different denominator for different sectors, 2) the relative comparison is not clearly shown. As a result, the sector-wise comparison is not coming out clearly (eg, it looks like Housing has the maximum contribution) and no clear comparison between Indian average and Elite is coming out. I have redesigned this as a column chart after normalizing the base at 1, ie, all the ratio has been shown as 1:x, where 1 is the emission by average Indian and x is the emission by elite Indian. This redesigned chart shows that "Health and Education" sector is the highest contributor by elite Indians in terms of ratio for the same by average Indian. 2. How much money India needs for IGDThe author has presented this in 2 different donut charts showing the expected investment money and the employment created. The problems with this design are that - 1) one has to refer 2 different charts for corelating, that is, there is no implicit co-relation is appearing in the chart, 2) color chosen could have been better. To redesign this, I have chosen bubble chart, where the investment money is being shown in X axis, job/employment created is shown on Y-axis. Along with that, more relevant color has been chosen, the size of the circle will give idea about employment created. 3. How can this be achievedThe author has projected the expenditure by Indian elites between the year 2023 and 2032 and shown that in line chart. In another line chart, the author has shown a declining rate of proposed wealth tax for the same period. To redesign this, I have created bubble chart with varying size and color of dots. This single chart shows 1) The declining tax rates for year 2023 to 2032, 2) The increasing bubble sizes shows the increasing expenditure by Indian elites, 3) The color of dot (Red to Green) shown that the money collected is increasing. The popup at each data point will show the year, tax-rate, expenditure, money collection. So, in this proposed concluding chart we can show that the wealth tax is justified and that can be reduced over years. Also, despite the reduced tax rate, the money collection will increase as the expenditure by Indian elite will also increase. Note:The visualizations were created using flourish. https://public.flourish.studio/visualisation/18879865/ |
Name: Arvind Gunasekaran Assignment 4 - REDESIGNING DATA STORY ARTICLE: India no longer has more losses than wins in Test cricketI. Story the Author is Trying to Tell
II. Data Used to Tell the Story
Conclusion: These in-depth observations, analyses and improvements contribute to a comprehensive redesign of the data story on the performance of The Indian Cricket Team in Test Cricket through history. Thanks, |
Name: S R Srinivasan By-polls: an indication of a new anti-incumbencyOriginal article: https://www.thehindu.com/data/by-polls-an-indication-of-a-new-anti-incumbency/article68413953.ece What is the story the author is trying to tell?• Soon after the results of the Lok Sabha (LS) elections were announced, bypolls to 13 Assembly Seats (AS) were held across seven states What is the need for the analysis?• The diversity of India poses a challenge in conducting quantitative and qualitative analysis on the people’s perception of the policies of the government Understanding the data that powers the storyWhat data he/she is using to tell the story? Describe its details -- type of data, extent of the data, dimensions of the data, gaps in the data, what data is essential and what is irrelevant.Type of Data:Constituency-wise data published by ECI Extent of Data:The data spans a few months in 2024, covering LS elections and AS bye-elections. The geograhpical extent is poor since only 13 AS seats are being looked. Dimensions of the Data:Each basic row represents the performance of an alliance in the constituency. Most of the features are categorical, with the vote share features being numeric Encoding of the Data:Due to the limited number of features, the data is presented in the tabular form, without any visual encoding. Essential Data:The essential data includes the State (for easier reading), Name of AS constituency, Party (to distinguish between alliance constituents), Vote share in the two elections being considered. The derived data is the change in vote share percentage between the two elections. With the table itself being small, no data is irrelevant. How is it encoded, what problems are with it, and how have you attempted to improve it?Gaps in the Data:This would be discussed more in later sections. The main gap is the low extent of the data. An extension to the previous LS and AS elections would provide a more representative analysis. Table 1 | The table shows the NDA parties’ vote share in the 2024 Lok Sabha elections and the Assembly elections.Source of image: https://www.thehindu.com/data/68414003-Chart-1-bypolls.svg Table 2 | The table shows INDIA parties’s vote share in the 2024 Lok Sabha elections and the Assembly elections.Source of image: https://www.thehindu.com/data/68414006-Chart-2-bypolls.svg Conclusions by the AuthorBased on the tables, the author concludes that “the overall trend that emerges from the bypoll results in 13 Assembly seats indicates a sharp decline in the BJP/NDA’s vote shares from the Lok Sabha polls held less than two months ago, as well as improvements for the INDIA parties.” He further provides the following subjective conclusions (as quoted).
Potential improvements using external data sourceIt is well known that the Indian electorate makes different choices between LS and AS elections. As an example, the state of Odisha holds simultaneous elections and the vote share of the parties has been different. To put this in a graphic way, the same elector pushes one button with her left hand (for the LS election) and a different button with her right hand (for the AS election). It is beyond the scope of this assignment to list the reasons for this. It suffices to say that the author himself acknowledges this fact in a previous article analysing the AS elections in 2023. For data analysis, it is trivial to see that extending the series to more LS and AS elections would provide a more complete picture. I did this by adding a feature on the vote share in the previous assembly elections – this would have been over different years. Due to lack of time, I could not add AS wise results from the 2019 LS elections – this data is often found in individual state Election Commission pages. I encoded the data as a grouped bar chart, with AS constituencies in the Y-axis. It Is easy to see that the trend is quite unclear over the 3 elections. The data is smal enough to be encoded as a simple table; I chose a more visual encoding. 3-election vote share data for NDA Blochttps://public.flourish.studio/visualisation/18884689/ 3-election vote share data for I.N.D.I.A Blochttps://public.flourish.studio/visualisation/18884884/ Partial conclusions from extended data
No clear conclusion is posisble even from the extended data. Even if the improvements in the next section are addressed, the limited geographical extents makes it impossible to extrapolate a country wide, or even a a state wide trend from the data. In parrticular, there is no data to suppor a 'new anti-incumbency' as claimed in the title of data point analysis. Further improvements in the analysis
|
Name : Souvik Bhattacharjee Main story: Data used: Current Visual Representation: Critics: Redesign: |
Name - Harsh Y Mehta Assignment-04 Title - Personal loans disbursed via digital apps have the highest share of overdue accounts Source Link Overview As a result, banks reduced loans to industries and recovered more bad loans, reaching a healthy state in 2024 with a decadal-low Gross Non-Performing Assets (GNPA). However, they shifted focus to retail loans, such as personal loans and credit cards, which grew significantly. Despite regulatory measures, the GNPA ratio for personal loans fell to 1.2% in March 2024. The RBI's Financial Stability Report highlights concerns about rising slippages and high delinquency levels among small borrowers with personal loans below Rs. 50,000, especially from NBFC-Fintech lenders. These issues indicate potential future problems, with the RBI now worried about individuals rather than industries. Glossary
Full Forms
Key Takeaways
Charts Chart 2 | The chart shows the GNPA (in %) across sectors. Chart 3 | The chart shows the bank-type wise split of the share of slippages from retail loans in the overall new additions of NPAs. The chart excludes slippages in home loans. Slippages are fresh additions of bad loans in a year. Chart 4 | The chart shows the bank type-wise delinquency levels for personal loans below Rs. 50,000. Redesign ChartSlippage& Delinquency across Banks type-wise The combined charts allow for easier comparison of related data in a single view. They save space and make the information clearer and more readable. This improves efficiency, making data analysis quicker and more straightforward. |
Name: N K Vamsi KrishnaRoll_No: 21f1003596Story Overview:The article discusses the significant shifts in voter support during the 2024 Assembly bypolls across 13 constituencies, highlighting a potential trend of anti-incumbency. The data indicates a notable decline in vote shares for the BJP-led NDA coalition and gains for the opposition INDIA bloc, suggesting a shift in voter sentiment that could influence future elections. Source: The HinduData Used:Type of Data:
Extent and Dimensions:
Gaps in the Data:
Data Details:
Data Encoding:
Problems and Improvements:Problems:
Improvements:
Original Visualizations:Visualization 1:Visualization 2:Redesigned Visualizations:Interactive Visualizations - Flourish Visualization 1: Vote Share Gains and Losses in 2024 Assembly BypollsExplanation:
The chart clearly illustrates the trend of declining support for the NDA and rising support for the INDIA alliance across most constituencies, highlighting significant shifts in voter sentiment. Visualization 2: Seat Distribution in the 2024 Assembly BypollsExplanation:
The chart effectively highlights the competitive nature of the bypolls, with seats distributed across multiple parties, indicating a diverse political landscape. ConclusionThe redesigned visualizations provide a clearer and more engaging way to understand the data story of the 2024 Assembly bypolls. The vote share comparison bar chart highlights the significant shifts in voter support, while the seat distribution donut chart gives a quick overview of the outcomes. These visualizations help to communicate the broader narrative of potential anti-incumbency sentiment and its implications for future elections. |
Name: Abir Subroto Chakraborty Title: By-polls: an indication of a new anti-incumbency (Link)Author: Prasenjit BoseBased on the data provided in the images and the brief overview from the article, here’s the analysis: 1. What is the story the author is trying to tell?The author is highlighting a significant shift in voter preferences, indicating a decline in the vote share for the BJP-led NDA in recent bypolls, while the opposition INDIA bloc has gained considerable ground. This shift is seen as a possible indication of growing anti-incumbency sentiment against the BJP. 2. What data is used to tell the story?The data consists of vote share percentages for both the NDA and INDIA bloc parties in various constituencies during the 2024 Lok Sabha elections and the subsequent 2024 Assembly bypolls. The key elements include:
Details of the Data Tables Provided:Table 1: NDA PerformanceThis table details the vote share of the NDA (BJP and JDU) in various Assembly Constituencies (ACs) across multiple states in the 2024 Lok Sabha (LS) elections and the subsequent 2024 Assembly (AS) bypolls.
Table 2: INDIA PerformanceThis table presents the vote share of the INDIA bloc (including AITC, INC, AAP, VCK/DMK, and RJD) in the same constituencies and elections as Table 1.
The tables illustrate a notable shift in voter preference from the BJP-led NDA to the opposition INDIA bloc in recent bypolls compared to the Lok Sabha elections. The data indicates significant gains for the INDIA bloc across multiple states and constituencies, suggesting a growing anti-incumbency sentiment against the BJP. 3. How is it encoded, what problems are with it, and how have you attempted to improve it?
In summary, the data underscores a notable shift in voter sentiment against the BJP, with the INDIA bloc gaining traction, pointing to possible national implications for future elections. Conclusion by the AuthorThe recent bypoll outcomes signify a notable decline in support for the BJP-led NDA, with significant vote share losses across multiple states, contrasting the gains made by the opposition INDIA bloc. This trend suggests a potential shift in the national political landscape, possibly reflecting growing dissatisfaction with the BJP. The author emphasizes that while local factors may play a role, the overall decline in the BJP’s vote share across various constituencies points to a broader anti-incumbency sentiment. The opposition's gains indicate a possible change in voter mood, favoring the INDIA bloc in upcoming elections. |
Name: Ashrey Article: A Green Wealth Tax in Budget 2024Story the Author is trying to tellThe author is presenting the idea of a wealth tax-financed Indian Green Deal (IGD) that aims to address climate change, inequality, and unemployment. The story argues that the wealth tax on the Indian elite can fund a comprehensive green energy, infrastructure, and care economy program, ultimately generating millions of jobs and reducing carbon emissions. Data Analysis1. Per Capita Carbon Footprint (Chart 1):
2. Expenses and Carbon Intensity of Commodities (Chart 2):
3. Expenditure and Employment Generation (Chart 3a & 3b):
4. Projected Wealth and Tax Rate (Chart 4a & 4b):
Gaps in the Data
Essential vs. Irrelevant Data
Redesigned Visualizations1. Per Capita Carbon Footprint
2. Expenses and Carbon Intensity
3. Projected Expenditure and Wealth Tax Rate
4. Expenditure and Employment Generation
Final Redesign |
Redesigning the NEET-UG 2024 Data Story from The Hindu Data PointName: Muskan Sindhu Original Article: Select “coaching hubs” are host to many high scoring NEET-UG-2024 candidatesOriginal StoryThe original article highlights the cities and centers with the highest share of students scoring 650 or more in the NEET-UG 2024 exam. The main focus is on the exceptional performance in specific cities, particularly Sikar in Rajasthan, and the implications of these high scores for securing admissions in government medical colleges. Story AnalysisThe author aims to highlight the top-performing cities and centers in the NEET-UG 2024 exam and discuss the implications of these scores for medical college admissions. The data used includes quantitative data on NEET-UG scores, segmented by city and center, with percentages of students scoring above specific thresholds (650 and 700 marks). However, the article lacks historical comparison data and does not delve deeply into the reasons behind the high scores in specific centers. Essential data include scores by city and center, percentages of high scorers, and absolute numbers of high scorers.
Visual Encoding and ImprovementsThe current visual encoding includes a table listing the top centers with the highest share of students scoring above 650 and a scatter plot showing the percentage share of students who scored over 650 marks by state. The scatter plot may be overwhelming due to the large number of data points, and the table lacks visual appeal and could be enhanced with better design elements. Additionally, the context and significance of the data points could be explained better. To improve this, I propose simplifying the scatter plot to focus on the top states and adding annotations for clarity. The table should be enhanced with visual elements like bar graphs to show comparisons more clearly. Providing historical comparison data would add context to the current year's results.
Redesign ProcessTo enhance the visualizations, I have focused on improving the scatter plot and the table visualization. For the scatter plot, I focused only on the top-performing states, using color coding to differentiate them and adding annotations to highlight significant data points. A clear legend was used to explain the color coding. For the table visualization, I converted it into a bar chart where each bar represents the percentage of students scoring above 650, with a secondary axis showing the absolute number of students scoring above 650. Color coding was used to differentiate between cities and centers. Final NotesThe redesigned visualizations provide clearer insights into the distribution of high NEET-UG scores across various centers and states. By using annotations, clear legends, and contextual data, the visualizations aim to make the story more compelling and informative. This assignment helped me understand the importance of clear and effective data visualization, aiming to make the data more accessible and engaging for the audience, providing them with a better understanding of the NEET-UG 2024 results. |
Name: Indumathi Kalla What data is used to tell the story? Quantitative data representing the budget allocations in monetary terms. The entire budget for the fiscal year 2024-2025. Sector names. Specific details about sub-sector allocations might be missing. Essential: Sector names, allocation amounts, percentage shares, historical data for comparison. The original article used heat maps to visually represent the data. Pie Chart for Industry Percentage: Clarity: Redesigned the budget allocation visualization using a pie chart to show the percentage share of each industry in the total budget. This provides a clear and immediate understanding of the distribution. Trend Analysis: Used stacked growth bars to represent the changes in budget allocations over time. This helps in understanding both the individual and cumulative growth of different sectors. Scope Expansion: Pie Chart |
Name: Syed Afrin Gowhar Assignment: 2024 polls: How people in high and low income areas voted in Chennai’s Mylapore, T Nagar and other areas 1. Story and Data Understanding Story Objective: Highlight how voting patterns in Chennai's Lok Sabha elections vary by income levels, with a specific focus on the DMK and BJP's vote shares. Data Details: The dataset includes vote shares for the DMK and BJP across various areas in Chennai, categorized by income levels. This data is critical for analyzing the correlation between income levels and voting preferences. 2. Analysis and Visualization Plan A. Overall Trends and State-Wise Breakdown Visualization 1: Visualization 2: B. Detailed Area-wise Analysis Visualization 1: Visualization 2: Summary of the Redesign Process:
|
Name: Harsehraab Singh Sarao Original Article : On unemployment in Indian States Story by the author:
What data he/she is using to tell the story?
Type of data:
Extent of the data:
Gaps in the data:
What data is essential:
Key Findings
Original Encoding:
Problems:
Redesigning the Visualisation
|
Name: SOORYAKIRAN B Unemployment remains a concern in India post-pandemicStory the Author is Trying to TellThe author aims to highlight the persistent issue of unemployment in India, especially in the aftermath of the COVID-19 pandemic, by illustrating how various individuals are struggling with joblessness and how the Labour Force Participation Rate (LFPR) and Unemployment Rate (UR) have changed over time. Data utilisedTable 1: Labour Force Participation Rate (LFPR)The LFPR in India has shown a significant decline post-pandemic, indicating that fewer people are either working or seeking employment.
Encoding and Problems:
Redesign of Table 1InsightsTable 1: Labour Force Participation Rate (LFPR)Insights:
Table 2: Unemployment Rate (UR)The unemployment rate in India has increased post-pandemic, with a noticeable difference between urban and rural areas and between genders.
**Encoding and Problems: **
Redesign of Table 2InsightsTable 2: Unemployment Rate (UR)Insights:
Table 3: LFPR and UR by QuarterBoth LFPR and UR show quarterly trends over the years, with noticeable fluctuations around the pandemic period.
Encoding and Problems:
Redesign of Table 3InsightsTable 3: LFPR and UR for Quarters Ending in SeptemberInsights:
Table 4: LFPR and UR by MonthMonthly trends in LFPR and UR, showing how these metrics change within a year.
Encoding and Problems:
Redesign of Table 4Insights from the storyTable 4: LFPR and UR for November MonthsInsights:
Insights from StoryThe data reveals a concerning trend of decreasing labour force participation and persistently high unemployment rates, especially among females and in urban areas. The impact of the COVID-19 pandemic worsen these issues, and the recovery appears to be slow, indicating a need for targeted employment initiatives. |
Maoist Setbacks in Chhattisgarh, 2024Story the Author is Trying to Tell Data Analysis
Gaps in the Data
Essential vs. Irrelevant Data 1. Year-wise Deaths of Maoists in Chhattisgarh 2. Deaths of Civilians, Security Forces, and Maoists Over the Years 3. District-wise Average of Maoist Deaths 4. District-wise Development and Welfare Indicators Summary of the Article: Final Redesign Regards, |
Name: Nikita Sharma 2024 polls: How people in high and low income areas voted in Chennai’s Mylapore, T Nagar and other areasObjective:The article explores the voting patterns in Chennai, focusing on how wealth/income levels affect party preferences, specifically for DMK and BJP. It analyzes voting data across different streets/areas with varying income levels. Data Used
Visual Encoding in Original Visualization
Design Process and Improvement
Redesign Proposal
Iterations:Step 1: Creating the base line chart with the vote share percentage.Step 2: Adding gridlines and helpful insightsStep 3: Including hover tooltips and improving the layoutConclusionThe redesign aims to provide a clearer and more accessible visualization of voting patterns based on income levels, enhancing the reader’s ability to understand the correlation and key insights. |
Name: Kruttika Milind Soni In 2024, Maoists suffer severe setbacks in ChhattisgarhThe aim of this article https://www.thehindu.com/data/in-2024-maoists-suffer-severe-setbacks-in-chhattisgarh/article68395649.ece Chart 1Type of data: Number of Maoist deaths EncodingNumber of deaths are represented clearly with a line with points signifying the values. Continuity is met. Chart 2Type of data: Number of deaths of civilians, security forces, insurgents EncodingRed graph for maoist deaths is good semantic encoding. 3 different graphs show different magnitudes. Table 3Type of data: average number of Maoist deaths EncodingRed for important districts pulled attention to high deaths in some districts. The year wise data showed change in numbers over the years. Table 4Type of data: percentage of people of a certain group falling in the particular developmental parameters. EncodingNot much encoding done here except red coloured cells showing worst affected districts. Red as a colour calls attention quickly on light backgrounds. RedesignIteration 1Decided Chart 1 was a good introduction to the article. Planned to convert Table 4 into a choropleth map visualisation with different maps for each parameter. Iteration 2Developed the heatmap for Table 3. Annotated and changed colour of chart 1. Iteration 3Scatter plot for Chart 4 with Districts represented by dots and sanitation and women's education factors used as axes. Size of point determined by child nutrition(stunting). The final redesigned visualisations for the tables are: Table 3Table 4
Other data that can be used in this article includes:
Software used: Flourish, Excel, matplotlib, folium |
Name - Jigyasa
Key points :
The Data : Type of data : Extent of the data: Dimensions of the data: Gaps in the data: What data is essential? Analyzing and Improving the Visual Encodings : Original Visualizations Problems and Improvements Map: |
Name: Sakiley Pranay Deep Article: Which topics are India's researchers publishing papers on? Story the author is trying to tell: The author is trying to convey the story of the prevailing research trends in scientific and technological fields based on publications in the Web of Science database, and comparing India's most researched topics with those from other developed nations like USA and China through visualizations. The article showcases the global scientific community's focus on topics such as coronavirus, artificial intelligence, clean energy, and nanotechnology, and how different countries prioritize and contribute to research in these areas. The author also seems to be aiming to show how research trends can guide policy decisions and resource allocation, highlighting the importance of these researches in addressing global challenges and advancing technological progress. Key insights from the data: Research focus shift in India: There has been some shift in research focus in India in recent years (2019-2023) compared to the long-term (2004-2023). In recent years, there has been a focus on coronavirus research and nanotechnology (nano fluids and silver nanoparticles), whereas the long-term focus has been on nanotechnology again and wireless sensor networks. Focus on Corona virus research: Corona virus research has been a global focus in recent years, as it is the most published topic in USA and India according to the data for 2019-2023. It is worth noting that Corona virus has not been in China's five most researched topics. Deep Learning prominence: Deep learning is a rising area of research prominence in all three countries, with China having the highest number of publications in this field. China's strength in material science: China has a consistent focus on material science, as evidenced by their high publication rates in photocatalysis and supercapacitors in both recent and long-term datasets. USA's focus shift in recent years: The USA has also shown a shift in research focus in recent years, moving away from long-term focus areas like HIV, parenting, and galaxies to focus on coronavirus research and deep learning. Data used to tell the story: Type of data: This is quantitative data (ratio data), focusing on the volume of research paper outputs. It consists of the number of published research papers categorized by topics in India, the USA, and China. Extent of the data: Dimensions of the data: Gaps in the data: Essential data: Irrelevant data: The data used in this story is concise and specific. Hence there is no irrelevant data. Analysis of the original encoding: In each visualization, three bar charts have been placed next to each other for comparison (one representing each country) that show the count of research papers published in the top five research fields. Problems with this encoding: The major problem with this visualization is inconsistent scaling across bar charts of different countries. For example, India's Corona virus that has 12629 publications has a bar of smaller length than USA's Gut microbiota that has 12435 publications, which can be misleading. Improvement proposal for the redesign: This encoding can be redesigned in a better way by using a single bar graph with precise scaling for all three countries for each timeline, with different countries represented in different colours. Thank you! |
Name: Saranya Nayak Which topics are India’s researchers publishing papers on?Source: https://www.thehindu.com/data/which-topics-are-indias-researchers-publishing-papers-on/article68410121.eceWhat is the story the author is trying to tell?The author highlights the research focus trends in India and globally over the last two decades, with a specific emphasis on the last five years. The story reveals that while coronavirus remains a predominant research topic worldwide, India's researchers are also significantly contributing to deep learning, photocatalysis, and nanotechnology. The article contrasts India's concentrated efforts in nanotechnology, partly driven by the Nano Mission, with China's focus on high-impact technological fields and the U.S.'s diverse research interests, particularly in health and social well-being What data he/she is using to tell the story?The author uses data from the Web of Science, a scholarly publication database, to analyze research trends over the last 20 years and the last five years. This data includes the number of published papers on various topics by researchers from different countries, allowing for a comparative study of the most researched topics globally and within specific nations such as India, the U.S., and China. The article also references specific research outputs and projects, like India's Nano Mission, to illustrate the focus areas and the volume of research in these fields. ###What data he/she is using to tell the story? Describe its details -- type of data, extent of the data, dimensions of the data, gaps in the data, what data is essential and what is irrelevant. Type of Data:
Essential vs. Irrelevant Data:Essential Data: How is it encoded, what problems are with it, and how have you attempted to improve it?Encoding: Problems with the Data: For example: in case of Deep learning one has to look at different parts of the graph to come to an conclusion. As deep learning positions are not in the same place in the graph. Solution: Group similar topics and countries data and visualize using proper chart. Design Iterations :Iterations 1: Used Treemap to group the data and first by countries and then by topic. Iterations 2 : Stacked bar chart does help in visualizing the and comparing the total number of research paper and total number of research don by topic Iterations 3: Clustered Bar Chart helps in grouping the data in terms of country and topic. One can easily comapire the amount of effort given to different research areas in different countries. Final: |
Name: Varun Balaji Title: "2024 polls: How people in high and low income areas voted in Chennai’s Mylapore, T Nagar and other areas" Objective: The article examines voting patterns in Chennai's 2024 Lok Sabha elections, comparing high and low-income areas to understand if income levels influenced voting behaviour. Main Points:
Analyzing the Data: Types of Data:
Extent of Data:
Data Dimensions:
Gaps and Relevance:
Visual Encoding: Current Encoding: Scatter plots showing vote shares (DMK in red, BJP in blue) across streets with varying guideline values. Problems Identified:
Proposed Improvements: Redesign Strategy:
Design Process Documentation: - Step 1: Initial Analysis
- Step 2: Simplifying Visualization
- Step 3: Enhancing Accessibility
- Step 4: Adding Interactivity
- Step 5: Including Additional Data
By following these steps, the redesigned visualization will maintain the original story's integrity while enhancing clarity and accessibility. Here are the redesigned scatter plots for the voting patterns in Nungambakkam and Kodambakkam, and Adyar and Guindy:
Key Takeaways:
Insights:
Future Improvements: To further enhance the data story, integrating interactive elements and additional datasets, such as demographic information, can provide deeper insights. Interactive visualizations using tools like Plotly or D3.js can offer users the ability to explore data points in more detail, fostering a more engaging and informative experience. Overall, the redesign maintains the original story's intent while significantly improving accessibility and readability, making the data more meaningful and actionable for a wider audience. |
Name: Sujasha S Publisher’s Data Summary Expenditure Data (Chart 2 and 3 above):
|
Name: Natasha Mittal Data source: https://www.thehindu.com/data/which-topics-are-indias-researchers-publishing-papers-on/article68410121.ece Story Intent Data Description
Visual Encoding
2. Improved Visual EncodingSimplified Charts:
Final ThoughtsThe redesign aims to improve user engagement, clarity, and depth of the original story by leveraging interactive visualizations and additional contextual data. This approach ensures that the main intent of showcasing research publication trends in India is maintained while providing a richer and more insightful user experience. |
Name: Pranam Premanand Pagi Original Article: MPs 27 times wealthier than an average urban householdAuthors: Vignesh Radhakrishnan, Sambavi Parthasarathy Story of article in view of authorsThe article highlights the wealth disparity between Members of Parliament (MPs) in India and the average urban household. It points out that MPs are significantly wealthier, with the majority possessing assets far above the typical urban or rural household. This wealth concentration suggests that election candidacies are often limited to affluent individuals. Data Used:
Gaps in the Data:The data focuses primarily on wealth, without contextual information about income sources, liabilities, or the potential impact of these wealth disparities on electoral outcomes. Chart 1 | The chart shows the median assets of winners and runners-up in 2019 and 2024. Chart 2 | The chart shows the median assets of candidates of the major political parties in 2024. Chart 3 | The chart shows the average value of household assets for different decile classes for rural and urban areas in 2019 (in ₹ 1000s). Analysis of the Original Visualization and Design ConsiderationsOriginal Visualization:The original story uses multiple charts to illustrate the wealth of MPs compared to urban households. These include:
Problems Identified:
Improvement Suggestions:
Redesigning the StoryObjectives:
Steps in Redesign:
|
NAME: FASHMINA MOHAMED The story: Data Used: Original Encoding: Problems in the original encoding: Redesign Proposal: The purpose of redesigning includes better accesibility to the data and for a better understanding. |
Name: Dhibin Vikash K P Article Tittle: Sikar, Namakkal, Kota: Select “coaching hubs” are host to many high scoring NEET-UG-2024 candidates Data Used: Type of Data: Quantitative data on student scores from the NEET UG 2024 exams. Current Visual Encoding: Chart 1: A scatter chart displaying the percentage of students scoring above 650 marks across different cities. Table 2: A table listing the top centers with the highest share of candidates scoring above 650 marks. Problems with Current Encoding: Scatter Chart: Improvement Plan: Create more apparent, intuitive charts highlighting key insights without overwhelming the viewer. Improve the existing tabular format with interactive charts that convey the information of the centers conducting the exam and the percentage of candidates scoring above 650. Redesigned Visualizations: Bar Chart: Displaying the top cities with the highest share of students scoring above 650 marks. Data on the NEET scores of candidates was taken from the official websites and the below charts were prepared. Redesigned Heat Map for state-wise distribution of scores >650: Redesigned Bar chart showcasing the centers along with percentage of students scored above 650: Documentation Original Story: Link to the original story: https://www.thehindu.com/data/neet-ug-2024-data-reveals-top-cities-for-high-scoring-candidates-crucial-for-government-medical-college-admissions/article68441411.ece Redesign Documentation: Map Visualization: Bar chart: |
For this assignment, we'll use data stories from The Hindu Data Point. Use what you have learned in Week 4 & Week 5 for doing this assignment.
Select a story that you like, study it carefully, and redesign it. Specifically, we want you to focus on understanding the data that powers the story, and how it is visually encoded to tell the intended story. Document your design process, capturing the following:
You may choose to expand or curtail the scope of the data used in the story or add an additional dataset to tell the story better. But do not deviate from the main intent of the original story. In other words, it is a redesign exercise, and hence I do not want you to tell a different, unrelated story.
While you should provide a link to the original story, it might be useful to capture and display inline, appropriate parts of the original visualization, and your own design iterations to produce coherent documentation.
For reference, take a look at what the previous batches (2019,2020,2021, 2022 )did with this assignment.
The text was updated successfully, but these errors were encountered: