AI App Template

A template project to run ingestion and querying with AWS services.

Features

Fully Rust
Serverless (AWS Lambda)
Deploy with cargo lambda
File base vector graph with AWS S3
Ingestion queue system with DynamoDB Streams
Collection (group of documents) base search
DynamoDB as main database
AWS Cognito for authentication
Slack integration
User team

Architecture

Project structure

common: Common functions, e.g jwt decode, get env var
composer: Compose LLMs input prompt
database: Database module to interact with database (dynamodb)
document: Document module to parse documents and build document nodes, to chunk document with overlapped chunking
helpers: Helper functions for aws services
indexer: Indexer module to build vector graph with embedding models
lambdas: AWS lambdas functions to do ingestion with SQS, querying, slack API
resources: PDFium resource which need to mount in AWS Index lambda to parse PDF
slack: Slack module to handle slack integration

How it works

Ingestion

When an user uploads document to the system, system saves the document in S3
A indexing task is created
Document analyser analyzes the document layout and build a document graph
A document vector graph is created respect to the document graph with embedding model and store in S3
A overlapped chunking method is applied to reduce chance for incomplete context
User can associate the document to a collection for multiple documents querying

Querying

When received an user query
Query is embedded with embedding model
System scans all documents in the target collection and filter with cosine similarity
System picks top K document graph nodes
System constructs the GPT prompt with selected nodes as context
System send the enriched query to external GPT service
When system got response from external GPT service, a callback request will be triggered

Setup

Setup DynamoDB with stream filter which can in found in readme file.
Mount PDFium resources to lambda need to run PDF parsing. e.g. document-indexer lambda
Mount embedding model resources to lambda need to run embedding. e.g. document-indexer lambda and seach-api lambda
Map API lambdas with API gateway and set up auth

Deployment

Every lambda function in lambdas has two deployment command.

replace {{IAM_ROLE}} to AWS IAM Role for your project
cargo make stage: deploy a lambda function with suffix -stage
cargo make production: deploy a lambda function with optimized build

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.devcontainer		.devcontainer
common		common
composer		composer
database		database
docs		docs
document		document
helpers		helpers
indexer		indexer
lambdas		lambdas
resources/lib		resources/lib
slack		slack
.env-template		.env-template
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile.toml		Makefile.toml
README.md		README.md
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI App Template

Features

Architecture

Project structure

How it works

Ingestion

Querying

Setup

Deployment

About

Releases

Packages

Languages

License

russellwmy/ai-app-template

Folders and files

Latest commit

History

Repository files navigation

AI App Template

Features

Architecture

Project structure

How it works

Ingestion

Querying

Setup

Deployment

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages