fix: StringT: re-support other string encodings #62

srl295 · 2024-04-06T02:22:48Z

Fixes Unsupported string encodings #60
Fixes Newest version of restructure doesn't support all string encodings fontkit#331 - allows fontkit's unit tests to pass
Probably fixes Version 3.0.1 causes react-pdf renderToStream to never return #61
add a test for 'x-mac-roman'
add two utf-16 alias names
if an encoding is otherwise unknown, assume 1-byte length
(this matches prior behavior)

History: #59 enabled support for 2-byte utf encodings, which were previously broken. So #59+#62 presents no regression for any other encodings.

srl295 · 2024-04-06T02:23:18Z

I'm still investigating what is happening here.

src/String.js

srl295 · 2024-04-06T02:24:45Z

src/String.js

@@ -95,9 +95,12 @@ function encodingWidth(encoding) {
  switch(encoding) {
    case 'ascii':
    case 'utf8': // utf8 is a byte-based encoding for zero-term string
+    case 'x-mac-roman':


fontkit uses a few others as well. Need fallback handling? Or is it an API usage issue?

As this issue can cause an infinite loop, I would expect either:

a fallback

an explicit exception if the value is not handled and will possibly cause an infinite loop

I don't think there's an infinite loop here. Perhaps the calling code or its callers have a loop?

There's an explicit exception on line 109

@renchap for now, I have it assume 1-byte, which matches previous behavior

- add a test for 'x-mac-roman' - add two utf-16 alias names - if an encoding is otherwise unknown, assume 1-byte length (this matches prior behavior) Fixes: foliojs#60

srl295 · 2024-04-06T22:54:31Z

FYI @blikblum @devongovett @mcdurdin

blikblum · 2024-04-07T01:11:58Z

This seems to fix the fontkit issue. The ideal would be a reproduction for the react-pdf issue, but i think is good to go

mcdurdin

As this issue can cause an infinite loop, I would expect either:

a fallback

an explicit exception if the value is not handled and will possibly cause an infinite loop

I did a careful analysis of the while loop. I could not find any possibility of an infinite loop. It was, in my understanding, the exception in encodingLength() that was causing the hang, my guess because it was being handled in another layer and causing the stream reader to never emit end-of-stream -- which I think could be considered a separate bug in that layer. However I have not yet had time to analyse at that level. I will have a go when I am online again (travelling all this week with limited online availability).

The changes look good to me, thank you @srl295 for jumping on this while I was incommunicado. Apologies to all for the regression.

mcdurdin · 2024-04-07T20:57:05Z

src/String.js

+      //TODO: assume all other encodings are 1-byters
+      //throw new Error('Unknown encoding ' + encoding);
+      return 1;


It would be nice if the consumer could be warned of the unknown encoding, but not sure that there is facility for that.

Note that the pattern of throwing on an unknown encoding is a pattern copied from existing code in this same module, specifically in StringT.byteLength():

restructure/src/String.js

Lines 150 to 151 in fcf7d64

default:

throw new Error('Unknown encoding ' + encoding);

So is there a latent issue in byteLength() that may bite future users? I would assume that it does not need to be addressed as urgently as this.

^ the throw may have only been on encode() or bytelength() before, which is a more restricted set of encodings.

x-mac-roman is actually a known encoding, as well, it's handled by the underlying encoder. So one thing that could be done is to expand the set of encodings here to be exhaustive.

AFAIK this PR brings back the previous behavior which is good enough. Lets do the minimal changes, unless there's a reproduction with an actual bug

src/String.js

srl295 · 2024-04-08T13:03:09Z

@blikblum thanks. and I assume this will get a patch release also? By the way 3.0.1 is absent from https://github.com/foliojs/restructure/releases

blikblum · 2024-04-08T14:10:36Z

I have write access to github but don't have npm publish permission. @devongovett can publish a new version

IvanUkhov · 2024-04-19T11:00:53Z

Thank you for fixing! Looking forward to a release.

karlhorky · 2024-06-13T16:58:09Z

@devongovett friendly ping, would it be possible to get a new release, eg. restructure@3.0.2?

This is causing some confusing breakage (indefinite hanging with custom fonts, but only some of them) in @react-pdf/renderer and fontkit:

Using custom font causes usePDF instance to indefinitely remain as 'loading' diegomura/react-pdf#2675
Newest version of restructure doesn't support all string encodings fontkit#331

devongovett · 2024-06-14T17:17:24Z

Done!

srl295 mentioned this pull request Apr 6, 2024

Using custom font causes usePDF instance to indefinitely remain as 'loading' diegomura/react-pdf#2675

Open

srl295 commented Apr 6, 2024

View reviewed changes

src/String.js Show resolved Hide resolved

srl295 commented Apr 6, 2024

View reviewed changes

fix: StringT: assume unknown encodings are 1-byte

fcf7d64

- add a test for 'x-mac-roman' - add two utf-16 alias names - if an encoding is otherwise unknown, assume 1-byte length (this matches prior behavior) Fixes: foliojs#60

srl295 force-pushed the encfix branch from cb2f2c3 to fcf7d64 Compare April 6, 2024 22:49

srl295 marked this pull request as ready for review April 6, 2024 22:49

srl295 changed the title ~~fix: work in progress on encodings~~ fix: re-support other string encodings Apr 6, 2024

srl295 changed the title ~~fix: re-support other string encodings~~ fix: StringT: re-support other string encodings Apr 6, 2024

srl295 requested a review from renchap April 6, 2024 22:51

mcdurdin approved these changes Apr 7, 2024

View reviewed changes

blikblum merged commit 17e062c into foliojs:master Apr 8, 2024
3 checks passed

srl295 deleted the encfix branch April 8, 2024 13:01

devongovett mentioned this pull request Jun 14, 2024

Newest version of restructure doesn't support all string encodings foliojs/fontkit#331

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: StringT: re-support other string encodings #62

fix: StringT: re-support other string encodings #62

srl295 commented Apr 6, 2024 •

edited

Loading

srl295 commented Apr 6, 2024

srl295 Apr 6, 2024

renchap Apr 6, 2024

srl295 Apr 6, 2024 •

edited

Loading

srl295 Apr 6, 2024

srl295 commented Apr 6, 2024 •

edited

Loading

blikblum commented Apr 7, 2024

mcdurdin left a comment

mcdurdin Apr 7, 2024

srl295 Apr 8, 2024

srl295 Apr 8, 2024

blikblum Apr 8, 2024

srl295 commented Apr 8, 2024

blikblum commented Apr 8, 2024

IvanUkhov commented Apr 19, 2024

karlhorky commented Jun 13, 2024

devongovett commented Jun 14, 2024

fix: StringT: re-support other string encodings #62

fix: StringT: re-support other string encodings #62

Conversation

srl295 commented Apr 6, 2024 • edited Loading

srl295 commented Apr 6, 2024

srl295 Apr 6, 2024

Choose a reason for hiding this comment

renchap Apr 6, 2024

Choose a reason for hiding this comment

srl295 Apr 6, 2024 • edited Loading

Choose a reason for hiding this comment

srl295 Apr 6, 2024

Choose a reason for hiding this comment

srl295 commented Apr 6, 2024 • edited Loading

blikblum commented Apr 7, 2024

mcdurdin left a comment

Choose a reason for hiding this comment

mcdurdin Apr 7, 2024

Choose a reason for hiding this comment

srl295 Apr 8, 2024

Choose a reason for hiding this comment

srl295 Apr 8, 2024

Choose a reason for hiding this comment

blikblum Apr 8, 2024

Choose a reason for hiding this comment

srl295 commented Apr 8, 2024

blikblum commented Apr 8, 2024

IvanUkhov commented Apr 19, 2024

karlhorky commented Jun 13, 2024

devongovett commented Jun 14, 2024

srl295 commented Apr 6, 2024 •

edited

Loading

srl295 Apr 6, 2024 •

edited

Loading

srl295 commented Apr 6, 2024 •

edited

Loading