-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
meaning of min_freq parameter #4
Comments
It's the number of times a feature (e.g. a word / a bigram / a pos tag / a suffix / ...) should occur before it will be included in the model. |
Thank you very much, awesome answer! |
then I understand that value 1 and 0 for min_freq is the same right? |
please do not crosspost |
This question is not really related to your R code, but maybe you know the answer so let's try :)
was wondering what's the meaning of min_freq param, as in the documentation says it's a float. So I was always convinced it was a number in the range [0,1] (a percentage) but then I see you use 5.0.
Is that then the absolute frequency of a feature? (e.g. the number of times a feature appears in the training data)
Is it a requirement for the entire training set or per document?
Thanks!
The text was updated successfully, but these errors were encountered: