Skip to content

This repository contains my submission for Task-1 of the Data Science Internship at Prodigy Infotech.

Notifications You must be signed in to change notification settings

AvanishVerma1703/PRODIGY_DS_01

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Prodigy InfoTech Data Science Internship Task 1:


Welcome to my submission for Task 1 of the Data Science Internship at Prodigy Infotech. In this task, I have performed Exploratory Data Analysis (EDA) on a dataset provided, focusing on creating a visualization to represent the distribution of a categorical or continuous variable.

Dataset

The dataset used for this task is world_population_dataset. This dataset contains records of population from the year 2001 to 2022.

Tools and Libraries used

  • Jupyter notebook
  • Pandas
  • Numpy
  • Matplotlip & Seaborn for visualization

Exploratory Data Analysis (EDA)

During the EDA process, I performed the following steps:

  1. Data Cleaning: Checked for missing values, duplicates, and outliers in the dataset and handled them accordingly.

  2. Visualization: Created a bar chart, stacked chart to visualize the distribution of a categorical or continuous variable.

Conclusion

In conclusion, this EDA process provided valuable insights into the distribution of the selected variable in the dataset. This analysis lays the foundation for further exploration and modeling tasks in the data science workflow.

Thank you for reviewing my submission!

📬 Contact

For any inquiries or feedback regarding this project, please contact:

Releases

No releases published

Packages

No packages published