Mack Campbell
mack.campbell@pitt.edu
04/25/2023
This project will quantify how male and female characters are represented in the movies of the Cornell Movie Dialogue Corpus, highlighting both similarities and difference as they arise in each interaction. The corpus has movies from 1927 through 2010 so I will also look at diachronic variation.
The data come from the Cornell Movie-Dialogue Corpus and can be found in this repo here. The corpus has a README to help understand its structure and contents.
final_report.md
is the write-up of my final reportdata/
houses the original Cornell Movie-Dialogue Corpus Datanew_data/
contains my re-working of the data as.csv
filesdata_visualization/
has.png
files of all graphsdataframe_notebooks/
has.jnb
files cleaning and working with the different data setsanalysis_notebooks/
contains.jnb
files compiling and analyzing the dataLICENSE.md
is the license for sharing the data and code in this repoproject_plan.md
is my initial project planprogress_report.md
contains my progress on the project throughout the semester
You can visit my guestbook here.