Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

适配其他国家的语言 #8

Open
DorisGM opened this issue Mar 15, 2019 · 2 comments
Open

适配其他国家的语言 #8

DorisGM opened this issue Mar 15, 2019 · 2 comments

Comments

@DorisGM
Copy link

DorisGM commented Mar 15, 2019

请问 如果要适配大多数其他国家的语言,要怎么做?

是要自己找到语料库,然后训练? 不同国家,分词方法还不一样吗?

@DorisGM DorisGM changed the title 适配其他国家的语烟 适配其他国家的语言 Mar 15, 2019
@jsksxs360
Copy link
Owner

你好,Word2Vec 是一个语言模型,词向量只是副产物。对于任意一种语言,词向量都需要通过在大量语料上训练来得到。分词主要针对中文(因为中文词与词之间没有分隔符),英语直接用空格分都分得差不多了。

@DorisGM
Copy link
Author

DorisGM commented Mar 18, 2019

好的 谢谢解答。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants