GitHub - raquelbaeta/r-masters-datasets: This project focuses on cleaning and merging datasets — includes datasets related to cocaine seizures, government data, state lists, World Governance Indicators (WGI), and military expenditure (MILEX). The final cleaned dataset is saved in both CSV and RDS formats.

Project: Data Cleaning and Merging for Master Research Report

This project involves cleaning and merging multiple datasets to create a comprehensive dataset for a master research report. The scripts provided in this repository perform data cleaning, transformation, and merging operations on various datasets, including cocaine data, government data, state list data, World Governance Indicators (WGI) data, and military expenditure (MILEX) data.

To run the scripts in this repository, you need to have R installed on your system.

Additionally, you need to install the following R packages: install.packages(c("readxl", "dplyr", "tidyverse", "countrycode", "fuzzyjoin"))

Set the Working Directory

Update the working directory in the script to the location where you have placed your datasets:

setwd("~/path-to-your-directory/cleaning_data")

The script will perform the following operations:

_Install necessary libraries

_Load and clean individual datasets

_Merge the cleaned datasets

_Save the final merged dataset as .csv and .rds files

Datasets

The following datasets are used in this project:

(a) cocaine_data_long.csv: Contains data on cocaine seizures.

(b) government_data.csv: Contains government-related data.

(c) statelist.csv: Contains data on various states.

(d) wgi_data.csv: Contains World Governance Indicators data.

(e) milex_data.csv: Contains military expenditure data.

Scripts

data_cleaning_and_merging.R: Main script that performs data cleaning and merging operations.

The cleaned and merged dataset will be saved in the following files:

cleaned_data.csv: CSV format of the final dataset.

cleaned_data.rds: RDS format of the final dataset.

Contributing

If you would like to contribute to this project, please follow these steps:

Fork the repository. Create a new branch (git checkout -b feature-branch). Make your changes. Commit your changes (git commit -m 'Add some feature'). Push to the branch (git push origin feature-branch). Create a new Pull Request.

If you have any questions or need further assistance, please open an issue or contact the repository maintainer.

Contact

For any inquiries, please contact raquel@aside.co.za.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
README.md		README.md
cleaned_data.csv.rds		cleaned_data.csv.rds
cleaning_data.rds		cleaning_data.rds
cleaning_milex.R		cleaning_milex.R
cleaning_seizures.R		cleaning_seizures.R
cleaning_statelist.R		cleaning_statelist.R
cleaning_wb.R		cleaning_wb.R
cleaning_wgi.R		cleaning_wgi.R
cocaine_data_long.csv		cocaine_data_long.csv
data.csv.rds		data.csv.rds
functions.R		functions.R
government_data.csv		government_data.csv
government_data.csv.rds		government_data.csv.rds
merge_data.R		merge_data.R
milex_data.csv		milex_data.csv
milex_data.csv.rds		milex_data.csv.rds
statelist.csv		statelist.csv
statelist.rds		statelist.rds
wgi_data.csv		wgi_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: Data Cleaning and Merging for Master Research Report

Set the Working Directory

Datasets

Scripts

Contributing

Contact

About

Releases

Packages

Languages

raquelbaeta/r-masters-datasets

Folders and files

Latest commit

History

Repository files navigation

Project: Data Cleaning and Merging for Master Research Report

Set the Working Directory

Datasets

Scripts

Contributing

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages