Configuration

Topics

Overview
GraphRAGConfig
Logging configuration

Overview

The graphrag-toolkit provides a GraphRAGConfig object that allows you to configure the LLMs and embedding models used by the indexing and retrieval processes, as well as the parallel and batch processing behaviours of the indexing pipelines. (The graphrag-toolkit doesn't use the LlamaIndex Settings object: attributes configured in Settings will have no impact in the graphrag-toolkit.)

The graphrag-toolkit also allows you to set the logging level and apply logging filters from within your application.

GraphRAGConfig

GraphRAGConfig allows you to configure LLMs, embedding models, and the extract and build processes. The configuration includes the following parameters:

Parameter	Description	Default Value	Environment Variable
`extraction_llm`	LLM used to perform graph extraction (see LLM configuration)	`anthropic.claude-3-sonnet-20240229-v1:0`	`EXTRACTION_MODEL`
`response_llm`	LLM used to generate responses (see LLM configuration)	`anthropic.claude-3-sonnet-20240229-v1:0`	`RESPONSE_MODEL`
`embed_model`	Embedding model used to generate embeddings for indexed data and queries (see Embedding model configuration)	`cohere.embed-english-v3`	`EMBEDDINGS_MODEL`
`embed_dimensions`	Number of dimensions in each vector	`1024`	`EMBEDDINGS_DIMENSIONS`
`extraction_num_workers`	The number of parallel processes to use when running the extract stage	`2`	`EXTRACTION_NUM_WORKERS`
`extraction_num_threads_per_worker`	The number of threads used by each process in the extract stage	`4`	`EXTRACTION_NUM_THREADS_PER_WORKER`
`extraction_batch_size`	The number of input nodes to be processed in parallel by all workers in the extract stage	`4`	`EXTRACTION_BATCH_SIZE`
`build_num_workers`	The number of parallel processes to use when running the build stage	`2`	`BUILD_NUM_WORKERS`
`build_batch_size`	The number of input nodes to be processed in parallel by each worker in the build stage	`25`	`BUILD_BATCH_SIZE`
`batch_writes_enabled`	Determines whether, on a per-worker basis, to write all elements (nodes and edges, or vectors) emitted by a batch of input nodes as a bulk operation, or singly to the graph and vector stores (see Batch writes)	`True`	`BATCH_WRITES_ENABLED`
`enable_cache`	Determines whether the results of LLM calls to models on Amazon Bedrock are cached to the local filesystem (see Caching Amazon Bedrock LLM responses)	`False`	`ENABLE_CACHE`

To set a configuration parameter in your application code:

from graphrag_toolkit import GraphRAGConfig

GraphRAGConfig.response_llm = 'anthropic.claude-3-haiku-20240307-v1:0' 
GraphRAGConfig.extraction_num_workers = 4

You can also set configuration parameters via environment variables, as per the variable names in the table above.

LLM configuration

The extraction_llm and response_llm configuration parameters accept three different types of value:

You can pass an instance of a LlamaIndex LLM object. This allows you to configure the graphrag-toolkit for LLM backends other than Amazon Bedrock.
You can pass the model id of an Amazon Bedrock model. For example: anthropic.claude-3-haiku-20240307-v1:0.
You can pass a JSON string representation of a LlamaIndex Bedrock instance. For example:

{
  "model": "anthropic.claude-3-sonnet-20240229-v1:0",
  "temperature": 0.0,
  "max_tokens": 4096,
  "streaming": true
}

Embedding model configuration

The embed_model configuration parameter accepts three different types of value:

You can pass an instance of a LlamaIndex BaseEmbedding object. This allows you to configure the graphrag-toolkit for embedding backends other than Amazon Bedrock.
You can pass the model name of an Amazon Bedrock model. For example: amazon.titan-embed-text-v1.
You can pass a JSON string representation of a LlamaIndex Bedrock instance. For example:

{
  "model_name": "amazon.titan-embed-text-v2:0",
  "dimensions": 512
}

When configuring an embedding model, you must also set the embed_dimensions configuration parameter.

Batch writes

The graphrag-toolkit use microbatching to progress source data through the extract and build stages.

In the extract stage a batch of source nodes is processed in parallel by one or more workers, with each worker performing chunking, proposition extraction and topic/statement/fact/entity extraction over its allocated source nodes. For a given batch of source nodes, the extract stage emits a batch of chunks derived from those source nodes.
In the build stage, chunks from the extract stage are broken down into smaller indexable nodes representing sources, chunks, topics, statements and facts. These indexable nodes are then processed by the graph construction and vector indexing handlers.

The batch_writes_enabled configuration parameter determines whether all of the indexable nodes derived from a batch of incoming chunks are written to the graph and vector stores singly, or as a bulk operation. Bulk/batch operations tend to improve the throughput of the build stage, at the expense of some additonal latency with regard to this data becoming available to query.

Caching Amazon Bedrock LLM responses

If you're using Amazon Bedrock, you can use the local filesystem to cache and reuse LLM responses. Set GraphRAGoOnfig.enable_cache to True. LLM responses will then be saved in clear text to a cache directory. Subsequent invocations of the same model with the exact same prompt will return the cached response.

The cache directory can grow very large, particularly if you are caching extraction responses for a very large ingest. The graphrag-toolkit will not manage the size of this directory or delete old entries. If you enable the cache, ensure you clear or prune the cache directory regularly.

Logging configuration

The graphrag_toolkit's set_logging_config method allows you to set the logging level, and apply filters to DEBUG log lines. Besides the logging level, you can supply an array of prefixes to include when outputting debug information, and an array of prefixes to exclude.

The following example sets the logging level to DEBUG, but also applies a filter that specifies that only messages from the storage module are to be emitted:

from graphrag_toolkit import set_logging_config

set_logging_config(
  'DEBUG', 
  ['graphrag_toolkit.storage']
)

The following example sets the logging level to DEBUG, together with a filter that specifies that only messages from the storage module are to be emitted, and another filter that excludes messages from the graph store factory within the storage module:

from graphrag_toolkit import set_logging_config

set_logging_config(
  'DEBUG', 
  ['graphrag_toolkit.storage'],
  ['graphrag_toolkit.storage.graph_store_factory']
)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configuration.md

configuration.md

Configuration

Topics

Overview

GraphRAGConfig

LLM configuration

Embedding model configuration

Batch writes

Caching Amazon Bedrock LLM responses

Logging configuration

Files

configuration.md

Latest commit

History

configuration.md

File metadata and controls

Configuration

Topics

Overview

GraphRAGConfig

LLM configuration

Embedding model configuration

Batch writes

Caching Amazon Bedrock LLM responses

Logging configuration