You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Came to this project for spelling in twitter text, but it doesn't quite work most of the time.
spell correction seems to only work when annotate is set as in the example. Now
take the same example and set annotate={} and spell correction is gone:
i saw the new john doe movie and it suuuuucks ! ! ! waisted <money> . . . bad movies <annoyed>
if I restore annotate={"hashtag", "...}, then it corrects suuuuucks to sucks
I'm not sure what is the connection between annotations and spell correction.
spelling-correction doesn't work in general. Again, going back to your pipeline example, change the first input sentence to inject some spelling errors: CANT WAIT for the neww seaason of #TwinPeaks , run it, you get: cant wait for the neww seaason of twin peaks - i.e. no spell correction.
The spell_correct_elong doesn't seem to make a difference.
Yet, if I run:
from ekphrasis.classes.spellcorrect import SpellCorrector
sp = SpellCorrector(corpus="english")
print([sp.correct(x) for x in "neww seaason".split()])
It corrects: ['new', 'season']
The text was updated successfully, but these errors were encountered:
stas00
changed the title
spelling correction mostly not working
spelling correction mostly is not working
Mar 14, 2020
Came to this project for spelling in twitter text, but it doesn't quite work most of the time.
annotate
is set as in the example. Nowtake the same example and set
annotate={}
and spell correction is gone:if I restore
annotate={"hashtag", "...}
, then it correctssuuuuucks
tosucks
I'm not sure what is the connection between annotations and spell correction.
CANT WAIT for the neww seaason of #TwinPeaks
, run it, you get:cant wait for the neww seaason of twin peaks
- i.e. no spell correction.The
spell_correct_elong
doesn't seem to make a difference.Yet, if I run:
It corrects:
['new', 'season']
The text was updated successfully, but these errors were encountered: