This is a collection of scripts for pulling out all the comments from a collection of CLAN files.
$: python clancomments2.py --dirs [list of directories with CLAN files]
$: python clancomments2.py --files [list of CLAN files]
$: python clancomments.py /path/to/folder output_file.csv [--subj-files]
The --subj-files flag is optional. Only use that flag if /path/to/folder is Subject_Files. Otherwise, leave it out when passing in a flat folder full of CLAN files.
This will generate a csv file with all the comments found in each .cha file.
$: python filter.py clancomments_output.csv
This will filter out all the comments except for the personal info comments and get rid of .bak versions of files.
The output file will have the same name/path as the input, except with "pinfo_filtered" tagged on at the end.
$: python pinfo_table.py unfiltered_comments.csv filter_py_output.csv
This will generate a table filled with different values based on the personal info status of every subject/visit file.
You should pass it the unfiltered comments csv and the output of the filter_pinfo.py script (all comments filtered down to just the personal info comments).