[Feature Request] Prompt caching #1353

gius · 2024-11-11T17:59:20Z

Do you plan to make use of Prompt caching? It should make multiple commands for a single MR faster and cheaper.

https://openai.com/index/api-prompt-caching/
https://www.anthropic.com/news/prompt-caching

mrT23 · 2024-11-14T06:35:07Z

According to the documentation, caching happens automatically.

gius · 2024-11-19T10:18:28Z

My use case is when you run multiple commands (/describe, /review, /improve) on a single MR. In that case, the part you want to cache is the MR diff.

As I understand the documentation, in case of OpenAI, you'd need all three commands to start with the same text (probably a shared intro followed with the diff) and move the specific instructions to the end of the prompt.
For Claude, you need to put the diff into a separate section marked with "cache_control": {"type": "ephemeral"}

mrT23 · 2024-11-20T15:51:53Z

The prompt needs to be different for the different command

Even the code diff is presented differently (with\without lines, token cap, metadata, ...)

So to your question - we probably dont support auto caching

mrT23 added the answered label Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Prompt caching #1353

[Feature Request] Prompt caching #1353

gius commented Nov 11, 2024

mrT23 commented Nov 14, 2024

gius commented Nov 19, 2024 •

edited

Loading

mrT23 commented Nov 20, 2024 •

edited

Loading

[Feature Request] Prompt caching #1353

[Feature Request] Prompt caching #1353

Comments

gius commented Nov 11, 2024

mrT23 commented Nov 14, 2024

gius commented Nov 19, 2024 • edited Loading

mrT23 commented Nov 20, 2024 • edited Loading

gius commented Nov 19, 2024 •

edited

Loading

mrT23 commented Nov 20, 2024 •

edited

Loading