Skip to content

Commit

Permalink
Update SPEAR architecture in /doc/design folder (#3)
Browse files Browse the repository at this point in the history
  • Loading branch information
liborui authored Dec 18, 2024
1 parent 157327b commit d5727b9
Show file tree
Hide file tree
Showing 2 changed files with 30 additions and 0 deletions.
Binary file added docs/design/img/spear-arch.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
30 changes: 30 additions & 0 deletions docs/design/spear-architecture.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
# SPEAR high-level design

## Design principles
SPEAR's first design principle is to help developers and end-users an easier approach to develop and deploy an AI Agent in Cloud-Edge integrated manner.

### Scalable
SPEAR aim to provide scalable agent service with the help of auto-scaling.

### Performant
SPEAR aim to serve Agent requests performant, with the help of prudently designed APIs and online scheduling system.

## Workflow

<div align="center">
<img width="500px" src="./img/spear-arch.png"></img>
</div>

### Offline Phase
- Input: Code of existing agent framework (e.g., LangChain)
- Output: Deployment instance and configuration
- Key technology
- Multi-instance selection
- Automatic code generation

### Online Phase
- Input: Generated instance of offline phase
- Output: Scalable and performant user request serving
- Key technology
- Cloud-edge-end integrated offloading
- Agent-oriented auto-scaling

0 comments on commit d5727b9

Please sign in to comment.