GE P-file reader: adaptive character encoding #156

alexcraven · 2024-11-05T13:27:44Z

ge_read_pfile and ge_pfile assumed utf-8 encoding in character strings within the p-file; this does not appear to be standard across systems. Suggested patch attempts a few likely encoding candidates, before falling back on a permissive ascii encoding.

After initial detection, subsequent conversions are more forgiving of an incorrect detection (substituting unknown characters, rather than crashing).

This bugfix allows us to also quantify data acquired from the right (høyre) side, and should also handle local patient names better.

`ge_read_pfile` and `ge_pfile` assumed utf-8 encoding in character strings within the p-file; this does not appear to be standard across systems. Suggested patch attempts a few likely encoding candidates, before falling back on a permissive ascii encoding.

alexcraven · 2024-11-11T13:02:59Z

Test datasets available here; the first item (P30101.7) functions correctly before and after fix, remaining items (P3010[2-4].7) function only after the suggested fix.

GE_character_encoding_test_data.zip

Corresponding test data https://github.com/user-attachments/files/17702724/GE_character_encoding_test_data.zip expected under spec2nii_test_data/ge/pFiles/PRESS/MR30.1

alexcraven added 2 commits November 5, 2024 14:20

Fix lint errors in updated GE reader

97455bc

alexcraven and others added 3 commits November 11, 2024 14:58

Added non-English character tests to test_ge_pfile

d5b5b97

Corresponding test data https://github.com/user-attachments/files/17702724/GE_character_encoding_test_data.zip expected under spec2nii_test_data/ge/pFiles/PRESS/MR30.1

Update submodule for new test data.

391cfc5

Fix directorys tructure.

494169b

wtclarke merged commit 8a9430e into wtclarke:master Nov 11, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GE P-file reader: adaptive character encoding #156

GE P-file reader: adaptive character encoding #156

alexcraven commented Nov 5, 2024

alexcraven commented Nov 11, 2024

GE P-file reader: adaptive character encoding #156

GE P-file reader: adaptive character encoding #156

Conversation

alexcraven commented Nov 5, 2024

alexcraven commented Nov 11, 2024