fix error when handling the extended ASCII codes in response output #15

yiqingzhang · 2016-12-15T05:36:03Z

I find out that when supplying a short name text with extended ASCII code (e.g., "Arès Méroueh") will generate an error in the results's 'originalText' tag. The reason is that requests lib will guess the response's encoding. In the case of "Arès Méroueh", it will give result like ISO8859-1, which is not true. According to the standford coreNLP. The default response encoding is utf-8. Adding this line( r.encoding = 'utf-8') will eliminate error like this.

In order to make the code more robust, one may change the interface and allow user to specify response encoding.

fix error when handling the extended ASCII codes in response output

b8d697a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix error when handling the extended ASCII codes in response output #15

fix error when handling the extended ASCII codes in response output #15

yiqingzhang commented Dec 15, 2016 •

edited

Loading

fix error when handling the extended ASCII codes in response output #15

Are you sure you want to change the base?

fix error when handling the extended ASCII codes in response output #15

Conversation

yiqingzhang commented Dec 15, 2016 • edited Loading

yiqingzhang commented Dec 15, 2016 •

edited

Loading