In this challenge we were provided with two different datasets:
The first one is called PyBank in which we were tasked with creating a Python script to analyze the financial records of our company. We were given a financial dataset called budget_data.csv, which is composed of two columns: "Date" and "Profit/Losses", and from this data we obtained:
- The total number of months included in the dataset
- The net total amount of "Profit/Losses" over the entire period
- The changes in "Profit/Losses" over the entire period, and then the average of those changes
- The greatest increase in profits (date and amount) over the entire period
- The greatest decrease in profits (date and amount) over the entire period.
The second one is called PyPoll in which we were tasked with helping a small, rural town modernize its vote-counting process. We were given a set of poll data called election_data.csv, which is composed of three columns: "Voter ID", "County", and "Candidate", and from this data, we analyzed the votes and calculated each of the following values:
- The total number of votes cast
- A complete list of candidates who received votes
- The percentage of votes each candidate won
- The total number of votes each candidate won
- The winner of the election based on popular vote.
PyPoll has two scripts - ("main.py" and "poll_script_2.py"), both work perfectly. It is just that they have kind of different logic.