Repeated line breaks when using JSON Logits Processor (vLLM) #715
LilianJim
started this conversation in
Weird model output
Replies: 1 comment 1 reply
-
The "workaround" is the solution to this; the default behavior of Outlines is to let the model choose the whitespace pattern but not all models seem to fare fine with that. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Describe the issue as clearly as possible:
When using a
JSONLogitsProcessor
to constrain the output, a bug that often occurs is the model generated line breaks (\n
) indefinitely.I don't think this is the expected behavior, and for now I've only witnessed it with Mixtral 7x8B
Steps/code to reproduce the bug:
reproduce.py
Expected result:
Error message:
Output of
python3 reproduce.py --model casperhansen/mixtral-instruct-awq --quantization awq
Outlines/Python version information:
Version information
Context for the issue:
Found a workaround by passing
whitespace_pattern=r"[\n ]?"
toJSONLogitsProcessor
, which seems to mitigate the issue. Despite this, the described default behavior is probably not wanted.Beta Was this translation helpful? Give feedback.
All reactions