A lot of implementations of common algorithms for knowledge discovery from data.
Includes:
- Apriori algorithm for finding frequent itemsets
- Cosine and Okapi BM25 distance calculation to find similarity between documents
- Decision Tree implementation for... making decisions
- Naive Bayes implementation for making classifications as well based on probabilities
- Page Rank algorithm to find the order of "importance" of web pages, research papers, etc.