What is summarization / is it working? #2183

fkohrt · 2024-03-23T20:43:47Z

fkohrt
Mar 23, 2024

I am using the Azure OpenAI models and have enabled summarization on the endpoint-level. I am not sure, however, where exactly something is being summarized, now that I enabled it. At first I thought this refers to automatic generation of chat titles, but they are all called New Chat for me. Where can I find the summarization feature and how can I tell whether it works?

danny-avila · 2024-03-23T22:00:28Z

danny-avila
Mar 23, 2024
Maintainer

We are using an adaptation on the "ConversationSummaryBufferMemory" strategy to summarize messages.

To learn more about this, see this article: https://www.pinecone.io/learn/series/langchain/langchain-conversational-memory/

To summarize (lol), the summarization is triggered when the following conditions are met:

- You enable summarization
- There are messages that couldn't fit within half of the current model's token limit.

This worked well in the age of models with 4-8k context, when this was first implemented, operating within the "efficient" realm as shown in the article.

However, this needs to be revisited soon as we are now in the age of ever-increasing context windows (gpt-4-turbo with 128k and anthropic 200k+).

That means that we need to get to around 60-100k tokens for summarization to kick in. While this may alleviate costs from using the full context, it's sub-optimal.

I would also like to add an option for the user, through the config file, to decide what the summary context window should be, first on an endpoint-level then even on a model level.

2 replies

danny-avila Mar 23, 2024
Maintainer

The article highlights 2-3k range being optimal, but obviously this depends on your task:

Also adding a summary context window as a frontend option to enable it as a feature through presets/per-conversation.

fkohrt Mar 24, 2024
Author

Thank's, that was a very helpful explanation!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is summarization / is it working? #2183

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

What is summarization / is it working? #2183

fkohrt Mar 23, 2024

Replies: 1 comment · 2 replies

danny-avila Mar 23, 2024 Maintainer

danny-avila Mar 23, 2024 Maintainer

fkohrt Mar 24, 2024 Author

fkohrt
Mar 23, 2024

Replies: 1 comment 2 replies

danny-avila
Mar 23, 2024
Maintainer

danny-avila Mar 23, 2024
Maintainer

fkohrt Mar 24, 2024
Author