Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using a different dataset #12

Open
CatarinaPC opened this issue Jul 29, 2020 · 2 comments
Open

Using a different dataset #12

CatarinaPC opened this issue Jul 29, 2020 · 2 comments

Comments

@CatarinaPC
Copy link

Hello

I am trying to use a different dataset as input to the model.

What are the things I need to do?

I was trying to replicate the files that you have in the NYT_data directory but am confused with vec.bin and the first two columns in train.txt and test.txt.

Thank you in advance for any feedback you can give me

@RuiqingGao
Copy link

RuiqingGao commented Jul 17, 2021

I guess vec.bin is the model of word2vec, you should train the model by yourself.
but I have another question, that my word2vec model"model.save('w2v_50_50.bin') # 存储二进制模型文件" can not work correctly with extract.cpp.
look at the picture, wordTotal and word dimention are zero. so how can I save my bin format word2vec ?Can you help me ?
image

@RuiqingGao
Copy link

ooh, I solved my problem, by using other saving method of word2vec,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants