Use `ConjectureData.for_choices` #4219

tybug · 2024-12-29T19:37:46Z

Depends on #4217 (so test_draw_to_overrun finishes in a sane amount of time).

Here are three proposals for a replacement for ConjectureData.for_buffer on the typed choice sequence:

(1) pass the value of each choice:

    @classmethod
    def for_choices(
        cls,
        choices: Sequence[NodeTemplate | IRType],
        ...
    ):
        ...

(2) pass the value of each choice and its complexity index:

    @classmethod
    def for_choices(
        cls,
        choices: Sequence[NodeTemplate | tuple[IRType, int]],
        ...
    ):
        ...

(3) pass the value of each choice and its kwargs, noting that the complexity of a choice can be derived from its value + kwargs:

    @classmethod
    def for_nodes(
        cls,
        nodes: Sequence[NodeTemplate | IRNode],
        ...
    ):
        ...

I am proposing we use (1) in this pull. The benefits of this is mainly a simplified and unified api. It also makes constructing a ConjectureData from a database entry possible, where we do not have choice complexity available. However, I am leaning towards encoding complexity alongside db values anyway, for more precise sorting, so this may be a nonissue.

The downside of (1) is that we cannot use choice complexity during drawing, in particular misalignment. This leads to 129da4b, which fills misalignments with the zero-index choice instead of using complexity as an intermediary. I think using NodeTemplate to opt-in to controlling this misalignment behavior in the shrinker could be a more clean path forward. In the mean time, the default and only behavior for correcting misalignments would be filling with the zero-index choice.

The shrinking benchmark is neutral:

@DRMacIver if you have any thoughts on this from a shrinking perspective I'd definitely be curious to hear them.

Zac-HD

LGTM overall, feel free to merge once the comments below are handled to your satisfaction.

Zac-HD · 2025-01-06T08:09:03Z

hypothesis-python/src/hypothesis/internal/conjecture/data.py

-            return value
-
-        value = node.value
+            return value  # type: ignore # mypy misses eliminating the NodeTemplate type


I'd be inclined to assert isinstance(...) rather than ignoring, since this is unlikely to be a noticable perf hit.

assert isinstance(ChoiceT) is impossible and assert not isinstance(NodeTemplate) feels bad - I went for a slightly more roundabout type checking solution.

hypothesis-python/src/hypothesis/internal/conjecture/data.py

hypothesis-python/src/hypothesis/internal/conjecture/shrinking/collection.py

hypothesis-python/src/hypothesis/internal/conjecture/shrinking/common.py

hypothesis-python/src/hypothesis/internal/conjecture/shrinking/collection.py

tybug added 4 commits December 28, 2024 14:35

use choice_from_index(0) for misalignments

b895ba3

add and use ConjectureData.for_choices

0cfbd1b

add release notes

f34ff54

bump up a test's find_any budget

2199aa7

tybug requested a review from Zac-HD as a code owner December 29, 2024 19:37

tybug mentioned this pull request Dec 29, 2024

Use the typed choice sequence for @reproduce_failure #4220

Merged

Zac-HD approved these changes Jan 6, 2025

View reviewed changes

Zac-HD mentioned this pull request Jan 6, 2025

Add short-circuit for trivial collections #4217

Merged

tybug added 2 commits January 6, 2025 16:45

adjust typing, add trivial shrink test

1c6725b

consider zero+min in short_circuit, then just zero in run_step

fc95c0c

tybug requested a review from DRMacIver as a code owner January 6, 2025 21:49

Merge remote-tracking branch 'upstream/master' into for-choices

9698ca4

tybug merged commit f24f108 into HypothesisWorks:master Jan 7, 2025
47 checks passed

tybug deleted the for-choices branch January 7, 2025 00:40

tybug mentioned this pull request Jan 9, 2025

Migrate our core representation to the typed choice sequence #3921

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `ConjectureData.for_choices` #4219

Use `ConjectureData.for_choices` #4219

tybug commented Dec 29, 2024

Zac-HD left a comment

Zac-HD Jan 6, 2025

tybug Jan 6, 2025

Use ConjectureData.for_choices #4219

Use ConjectureData.for_choices #4219

Conversation

tybug commented Dec 29, 2024

Zac-HD left a comment

Choose a reason for hiding this comment

Zac-HD Jan 6, 2025

Choose a reason for hiding this comment

tybug Jan 6, 2025

Choose a reason for hiding this comment

Use `ConjectureData.for_choices` #4219

Use `ConjectureData.for_choices` #4219