-
Notifications
You must be signed in to change notification settings - Fork 208
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
meaning of min_freq #107
Comments
According to the doc: The example: A particular feature will be removed if it appears only once. Knowing how CRF works, it only makes sense at the dataset level. Check the behavior by inspecting the model. Run the command to get the model in text format: A feature that appeared only once in |
Thank you very much, awesome answer |
Sorry not sure I understand. You say that they are omitted if the value is no greater than VALUE. If for 0 the ones appearing once are not removed, for 2 the ones appearing twice are also removed? Maybe it's when value is no greater or equal than VALUE? but then 1 and 0 would be the same... |
Hi, was wondering what's the meaning of min_freq param, as in the documentation says it's a float. So I was always convinced it was a number in the range [0,1] (a percentage) but then I've seen other examples with e.g. =5.0
Is that then the absolute frequency of a feature? (e.g. the number of times a feature appears in the training data)
Is it a requirement for the entire training set or per document?
Thanks!
The text was updated successfully, but these errors were encountered: