Skip to content

Tookit-Sihui, a tool of some common algorithm, AI文本混合科学计算器(calculator-sihui), 句子词频-逆文本频率(TF-IDF),搜索BM25, 前缀树搜索关键词(trietree), 模板匹配-递归函数(func_recursive),中文数字转阿拉伯数字(chinese to number),阿拉伯数字转汉语数字, HMM, CRF

License

Notifications You must be signed in to change notification settings

yongzhuo/Tookit-Sihui

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Tookit-Sihui

tookit_sihui(代码主体,未完待续...)

- ml_common
  - BM25
  - TF-IDF
  - Trie-Tree
  - func_recursive
  - chinese_and_number
- task
  - calculate_sihui

run(运行)

- 1. 进入tookit_sihui/ml_common/tf_idf/目录,
        python tf_idf_freq.py

项目说明

  • ml_common
    • BM25(似乎有点问题)
    • TF-IDF(tf-idf, 可设置-保存tf和idf的文件)
    • Trie-Tree(前缀树,可实现人名-影视名等实体快速搜索)
    • func_recursive(递归,规则遍历生成句子)
    • chinese_and_number(中文汉字转阿拉伯数字,或者是阿拉伯数字转汉语数字,支持小数)
    • task
      • calculate_sihui(思慧计算器,AI智能文本计算器,支持从文本到计算结果的混合运算,还有指数运算,对数运算,阶乘等)

感谢|参考

About

Tookit-Sihui, a tool of some common algorithm, AI文本混合科学计算器(calculator-sihui), 句子词频-逆文本频率(TF-IDF),搜索BM25, 前缀树搜索关键词(trietree), 模板匹配-递归函数(func_recursive),中文数字转阿拉伯数字(chinese to number),阿拉伯数字转汉语数字, HMM, CRF

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages