Release for EngageCSEdu publication and relicensed to CC BY-4.0.
README.md
- Rewrote most of the content to better serve instructors and students.
- Updated the specification with additional visuals and examples from CSE 143 Autumn 2020.
scaffold
Vectorizer.BagOfWords
now lower-cases all the words.- Updated
TextClassifier
usage examples in themain
methods of each class to auto-detect the number of TSV lines. This is useful for supporting student-generated input files. - Scrubbed Windows
Ctrl+M
line endings from thespam.tsv
file that can throw-off theScanner.nextLine
. - Renamed
RandomSplitter
toTestSplitter
since that's what it's actually meant to do—not for randomness, but rather for testing. - Removed all
TextClassifierTest
dependencies beyond JUnit 5, including removal of the solution requirement. Expected integration test outputs are now hard-coded as text files.
Makefile
- Use pandoc to generate an
instructor-guide.docx
file from theREADME.md
. - Output to "materials".