gltfpack: Implement end-to-end fuzzing #625

zeux · 2023-10-26T07:06:47Z

While individual components of gltfpack like cgltf or mesh encoders have
their own dedicated fuzzers, gltfpack has a lot of code that can be
difficult to validate. We would expect that gltfpack is resistant to
invalid inputs (with the exception, perhaps, of .obj inputs where
fast_obj, the library we use, is currently not fuzz safe).

This change implements a in-memory GLB loading flow (parseGlb) and an
in-memory fuzzer; we disable all file I/O by using NULL paths, and
otherwise share all the same processing code with the exception of final
buffer writing.

This is work in progress as fuzzing issues are being discovered and fixed.

While individual components of gltfpack like cgltf or mesh encoders have their own dedicated fuzzers, gltfpack has a lot of code that can be difficult to validate. We would expect that gltfpack is resistant to invalid inputs (with the exception, perhaps, of .obj inputs where fast_obj, the library we use, is currently not fuzz safe). This change implements a in-memory GLB loading flow (parseGlb) and an in-memory fuzzer; we disable all file I/O by using NULL paths, and otherwise share all the same processing code with the exception of final buffer writing.

Note that the dictionary doesn't contain a lot of glTF extensions but it's not clear if we need this, because LLVM libFuzzer uses a mutator that uses strings from the binary in addition to supplied dictionary data. The GLB seed is taken from glTF-Sample-Assets BoxTextured, and is simultaneously reasonably small and covers all the basic features so it's a good candidate to use for a seed.

We technically do this validation during mesh processing, but it's better to flag any out of bounds indices earlier.

We currently have a limit of 16 on some stream based processing algorithms, and it's possible with custom attributes to reach a limit of 16, so for now we just drop extra streams.

To make sure that we don't spend too much time fuzzing the cgltf parser, we only add the files to the corpus if the parsing succeeded. As the comment explains, this is a tradeoff and it's not fully clear what is optimal here, but for now this seems to work okay.

This doesn't touch the top-level gltfpack executable which makes it easier to work on gltfpack while the fuzzer is working, and launches the fuzzer automatically.

zeux · 2023-10-26T23:18:58Z

This PR originally contained a few fixes to cgltf validation but I've extracted them into upstream PRs: jkuhlmann/cgltf#233 jkuhlmann/cgltf#234 jkuhlmann/cgltf#235. Once they get merged this PR should result in a fairly clean fuzzing run: a couple minor problems still remain, and there's some alignment validation missing that results in UBSAN failures (which this change doesn't enable for fuzzing config) but things seem to generally be in a good shape, at least with the default setting set.

zeux added 6 commits October 26, 2023 16:13

gltfpack: Add early index validation to parseMeshesGltf

596baab

We technically do this validation during mesh processing, but it's better to flag any out of bounds indices earlier.

gltfpack: Limit the number of non-target streams to 16

b84e83b

We currently have a limit of 16 on some stream based processing algorithms, and it's possible with custom attributes to reach a limit of 16, so for now we just drop extra streams.

Add gltffuzz target for convenience

1f7db0f

This doesn't touch the top-level gltfpack executable which makes it easier to work on gltfpack while the fuzzer is working, and launches the fuzzer automatically.

zeux force-pushed the gltffuzz branch from f63993c to 1f7db0f Compare October 26, 2023 23:16

zeux merged commit 0415662 into master Oct 26, 2023
12 checks passed

zeux deleted the gltffuzz branch October 26, 2023 23:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gltfpack: Implement end-to-end fuzzing #625

gltfpack: Implement end-to-end fuzzing #625

zeux commented Oct 26, 2023

zeux commented Oct 26, 2023

gltfpack: Implement end-to-end fuzzing #625

gltfpack: Implement end-to-end fuzzing #625

Conversation

zeux commented Oct 26, 2023

zeux commented Oct 26, 2023