Skip to content
This repository has been archived by the owner on Jul 1, 2018. It is now read-only.

Latest commit

 

History

History
32 lines (26 loc) · 1.06 KB

README.md

File metadata and controls

32 lines (26 loc) · 1.06 KB

uniQword

A simple program to count all the (unique) words contained in a text file and perform similar operations.

Instructions

  1. Download either uniqword.exe or uniqword.py and place it in the directory of the files you need it for.
  2. Run the script/executable.
  3. Add the file(s) or directory you need through the add_file and add_dir commands.

You can use the help command at any moment to learn about the functionalities of uniQword and how to use them.

Features:

  • Count and list all words.
  • Count and list unique words.
  • Frequency list for words.
  • Process multiple files at once.
  • Process multiple directories at once.
  • Print stats to file on demand.
  • Executable version.

Planned features:

  • Filter out function words, articles etc.
  • Recognise verb conjugations.
  • More statistical data on words.

Known limitations/bugs

  • Currently, words containing apostrophes will be counted as two separate words. Example: C'thulhu = C, thulhu.

Supported formats:

  • Plain text (.txt etc).
  • .pdf (including encrypted).
  • .docx.
  • .odt.