Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"get" and "getting" are handled inconsistently #1

Open
rspeer opened this issue Apr 9, 2010 · 0 comments
Open

"get" and "getting" are handled inconsistently #1

rspeer opened this issue Apr 9, 2010 · 0 comments

Comments

@rspeer
Copy link
Member

rspeer commented Apr 9, 2010

(from Launchpad bug list)

We have a number of passive concepts, like "get fired" and "getting served". They should be treated differently than their active counterparts, e.g., "fire" and "serve". "getting served" normalizes to "get serve", which seems right, but "get fired" normalizes to "fire".

I suspect "get" is a stopword but "getting" isn't, and stopword removal happens before lemmatization. Fixing this bug will require a test that clearly illustrates the desired behavior lest it break again ;)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant