You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I'm facing an issue with the CAMEL tools analyzer, specifically with the default character map.
For example, when analyzing the world "والتين" (and the fig), the disambiguator/analyzer reads the word as وآلتين (and two instruments), which is incorrect in my case.
Is there a different Character map that I can provide as a value for the norm_map argument of the analyzer? if CAMEL tools does not provide a different CharMapper, how can I define one so that the letters are processed exactly the way they are without any normalization or conversions?
I really hope you could help me solve this issue which I've been facing for a while now.
Thank you so much in advance!
The text was updated successfully, but these errors were encountered:
You can change the default CharMapper with the norm_map argument but that will not fix this particular issue. Furthermore, norm_map is used to specify the normalization expected by the morphological database so for all the databases we provide this shouldn't be changed.
This is most likely a limitation of the disambiguation model (ie. the model has seen very few instances of the word in that particular context if at all).
Can you tell us which disambiguation model you are using (MLE/BERT) and can you give us the example sentence this appears in?
We are working on implementing a new option to take into account the spelling of a word in the input (particularly input diacritics) and should help with such cases.
We don't have an exact timeline for this but we'll notify you here when it's done.
Hello, I'm facing an issue with the CAMEL tools analyzer, specifically with the default character map.
For example, when analyzing the world "والتين" (and the fig), the disambiguator/analyzer reads the word as وآلتين (and two instruments), which is incorrect in my case.
Is there a different Character map that I can provide as a value for the
norm_map
argument of the analyzer? if CAMEL tools does not provide a different CharMapper, how can I define one so that the letters are processed exactly the way they are without any normalization or conversions?I really hope you could help me solve this issue which I've been facing for a while now.
Thank you so much in advance!
The text was updated successfully, but these errors were encountered: