feat(autofix): Reproduction in root causes #1067

jennmueng · 2024-08-21T19:47:22Z

Introduces reproduction step in root cause output.
Uses gpt4o w/ structured output for the root cause for increased reliability
(Tacking on) Adds claude agent back into the plan+code agent

Eval results:

(Before is above, after is below)

roaga

Just some thoughts on code organization/cleanliness

src/seer/automation/autofix/components/root_cause/component.py

roaga · 2024-09-10T22:35:05Z

src/seer/automation/agent/client.py

+    def clean_tool_call_assistant_messages(self, messages: list[Message]) -> list[Message]:
+        new_messages = []
+        for message in messages:
+            if message.role == "assistant" and message.tool_calls:
+                new_messages.append(
+                    Message(role="assistant", content=message.content, tool_calls=[])
+                )
+            elif message.role == "tool":
+                new_messages.append(Message(role="user", content=message.content, tool_calls=[]))
+            else:
+                new_messages.append(message)
+        return new_messages
+


On this I meant not just as a helper function, but if any messages passed into the client for any completion can be cleaned behind-the-scenes to prevent the errors. That way external code doesn't have to choose whether or not to clean.

I don't think that's a good idea, given that openai can expect a tool call and its response to be used in a specific way for the LLM input. We should only use this workaround sparingly, in this case, only for the formatter.

jennmueng added 2 commits September 9, 2024 14:09

save

57ff931

save

8e49488

jennmueng force-pushed the jenn/autofix/root-cause-repro branch from e23ac2f to 8e49488 Compare September 10, 2024 16:54

jennmueng added 3 commits September 10, 2024 10:08

clean + update tests

c461e4e

add reproduction to test outputs

4718cb4

remove unused type ignore

45d678a

jennmueng marked this pull request as ready for review September 10, 2024 18:07

jennmueng requested a review from roaga September 10, 2024 18:07

roaga reviewed Sep 10, 2024

View reviewed changes

src/seer/automation/autofix/components/root_cause/component.py Outdated Show resolved Hide resolved

src/seer/automation/autofix/components/root_cause/component.py Outdated Show resolved Hide resolved

pr review response

fa69c68

jennmueng requested a review from roaga September 10, 2024 22:29

roaga approved these changes Sep 10, 2024

View reviewed changes

fix test

6ba5fa9

jennmueng merged commit c20a95e into main Sep 10, 2024
11 checks passed

jennmueng deleted the jenn/autofix/root-cause-repro branch September 10, 2024 23:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(autofix): Reproduction in root causes #1067

feat(autofix): Reproduction in root causes #1067

jennmueng commented Aug 21, 2024 •

edited

Loading

roaga left a comment

roaga Sep 10, 2024

jennmueng Sep 10, 2024

feat(autofix): Reproduction in root causes #1067

feat(autofix): Reproduction in root causes #1067

Conversation

jennmueng commented Aug 21, 2024 • edited Loading

Eval results:

roaga left a comment

Choose a reason for hiding this comment

roaga Sep 10, 2024

Choose a reason for hiding this comment

jennmueng Sep 10, 2024

Choose a reason for hiding this comment

jennmueng commented Aug 21, 2024 •

edited

Loading