Correctly fill nlp.prop_ner even with punctuation #112

ec-m · 2019-06-27T08:39:48Z

If I insert vanilla and chocolate one each then nlp.prop_ner is filled correctly with (('one', 'CARDINAL'),). However, if I instead write vanilla and chocolate, one each(i.e., simply adding punctuation to the sentence) nlp.prop_ner stays empty.

The text was updated successfully, but these errors were encountered:

josephbirkner · 2019-06-28T15:16:27Z

Thanks for writing this issue - Named Entity Recognition is definitely a big construction zone. It also fails mostly for NAME/LOCATION/ORGANIZATION if the input is not cased correctly. IMO this is also a big blocker for #96 . So we should really fix this asap!

josephbirkner · 2019-06-28T15:18:35Z

Fortunately, spacy provides easy extension mechanisms, especially for named entity recognition. If we use the en_medium NLP model, spacy provides word vectors, which we can match (with some tolerance) to named entities. For Cardinals, we can just detect cardinal words - that one should be easy to implement!

josephbirkner added this to the Inference milestone Jun 28, 2019

josephbirkner added the bug 🪲 Something isn't working label Jun 28, 2019

josephbirkner mentioned this issue Jul 16, 2019

Enhanced Rule-Based Inference State #96

Open

josephbirkner mentioned this issue Sep 9, 2019

prop_ner does not recognize "one" as a cardinal #135

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctly fill nlp.prop_ner even with punctuation #112

Correctly fill nlp.prop_ner even with punctuation #112

ec-m commented Jun 27, 2019

josephbirkner commented Jun 28, 2019 •

edited

Loading

josephbirkner commented Jun 28, 2019 •

edited

Loading

Correctly fill nlp.prop_ner even with punctuation #112

Correctly fill nlp.prop_ner even with punctuation #112

Comments

ec-m commented Jun 27, 2019

josephbirkner commented Jun 28, 2019 • edited Loading

josephbirkner commented Jun 28, 2019 • edited Loading

josephbirkner commented Jun 28, 2019 •

edited

Loading

josephbirkner commented Jun 28, 2019 •

edited

Loading