Skip to content

Latest commit

 

History

History
9 lines (7 loc) · 429 Bytes

README.md

File metadata and controls

9 lines (7 loc) · 429 Bytes

ACTib-word-frequencies

Calculates word frequencies in ACTib segmented corpus

make download to download the whole corpus. frequencies.py populates output/ with:

  • a folder per collection containing a frequency file per volume in the collection
  • one file per collection that adds up frequencies of all files in a given collection
  • total_freqs.txt which contains the general frequencies for the whole of ACTib.