Skip to content

Latest commit

 

History

History
312 lines (245 loc) · 12.4 KB

File metadata and controls

312 lines (245 loc) · 12.4 KB
# Requirements: python=3.12.4
# conda create -n jarvis python=3.12.4
# conda activate jarvis

# For Installing...
|> python setup.py

# For Running...
|> python jarvis_assistant/JARVIS_assistant.py
Running on local URL:  http://127.0.0.1:7860

To create a public link, set `share=True` in `launch()`.
_

# For dark mode ...
# http://localhost:7860/?__theme=dark

JARVIS Chatbot: a local simple RAG assistant with PDF files

  • Author: Mr. Jack Tung
  • Create: 03 July 2024 - 08 PM
  • Discuss: https://zalo.me/g/mtffzi945
  • If you like what I do, give me a star ^^ ~> ⭐

Why JARVIS?

  • All Free ~> 100% Free
  • Local Run ~> 100% Privacy
  • Open Source ~> 100% DIY Custom
  • 30 multi-languages support
  • RAG with PDF, DOCX, TXT files support
  • Multi-Function calling
  • Agent calling

Features

  • support Qwen2.5 is the latest series of Qwen large language models.
  • support 30 multi-languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.
  • supports long contexts of up to 128K tokens and can generate up to 8K tokens.
  • upload in any language PDF files and response with user language ^^
  • unlimit upload files to vector database
  • support PDF, DOCX, TXT, MD, MDX files
  • multi-files upload
  • custom Inferece settings
  • unlimit & auto save chat history to vector database
  • support custom System Prompt
  • Ollama model auto-loader
  • (v0.1.0) only 360 lines of python code in only 1 file ^^
  • custom Retrieval settings
  • support documents reference response
  • support Groq API, OpenAI API, Gemini API
  • support LiteLLM locally
  • workspaces management
  • on/off Retrieval
  • Multi-Function calling
  • ReACT agent
  • ReWOO agent
  • Retrieval grader
  • OpenAI Strawberry(o1) likely system prompt for Multi-Step Reasoning chains
  • support Light Mode | Dark Mode

JARVIS assistant (22 ⭐) .vs

Update next (v0.x.x)

alt-text

Update next (v0.1.x)

(Multilingual, Multimodal, MultiAgent, MultiDevice, ...)

Multimodal support

Tools and Multi-Agents

Mobile

Synthetic Data

Fine tune embbeding model for RAG pipeline ^^

Dynamically Semantic Router

All RAG techniques

GraphRAG

Continuous Pre-training & Fine-tuning

Document AI

Knowledge Editing

Update September 24, 2024 (v0.1.5)

  • update: Auto control installing dependencies with Poetry ^^

alt-text

Update September 21, 2024 (v0.1.5)

  • update: system prompt choosing or editing
  • udpate: Light Mode | Dark Mode

alt-text

Update September 18, 2024 (v0.1.5)

  • update: OpenAI Strawberry(o1) likely system prompt for Multi-Step Reasoning chains
  • update: support Qwen2.5 is the latest series of Qwen large language models.

alt-text

Update September 02, 2024 (v0.1.5)

alt-text

+	-------------------- workflow ---------------------------------
|	v0.1.5
|	JARVIS_assistant.py
|		|
|		~> gui.py ~> custom_ui_style.py
|			|
|			~> gui_action.py ~> model_settings.py , tools.py , prompts.py , utils.py , get_model_list.py
|				|
|				~> db_helper.py  ~> file_readers.py
|					|
|					~> datasource_router.py , grader.py: retrieval_grader()
+ -----------------------------------------------------------------

Update Jul 31, 2024 (v0.1.4)

  • add: Multi-Function calling
  • add: ReACT agent
  • add: API Keys management
  • add: 3 rounds chat-history memory
  • update: support both MacOS and Windows

alt-text

alt-text

Update Jul 28, 2024 (v0.1.3)

  • add: Workspaces management
  • add: On/Off Retrieval
  • add: support DOCX files

alt-text

alt-text

Update Jul 15, 2024 (v0.1.2)

Installation

alt-text

Notes:

chroma-core/chroma#189 (comment)

if you have an issue on Windows OS, while doing pip install chromadb. To Resolve this issue,

You need to download https://visualstudio.microsoft.com/visual-cpp-build-tools/ first.

Next, navigate to "Individual components", find these two

MSVC v143 - VS2002 C++ x64/86 build tools (lates) and Windows 10 SDK

then: pip install -U chromadb

Update Jul 09, 2024 (v0.1.1)

alt-text

Screenshot Jul 03, 2024 (v0.1.0)

alt-text

BONUS: How To Create Custom Ollama Models From HuggingFace ( GGUF ) file

URL: https://ollama.com/mrjacktung/mradermacher-llamax3-8b-alpaca-gguf

step 1: Go to

step 2: Download GGUF file

  • Choice: Q4_K_M | 5.0GB | fast | recommended

step 3: Create Modelfile (from Terminal)

  • $ echo ‘FROM “./LLaMAX3-8B-Alpaca.Q4_K_M.gguf”\nTEMPLATE “{{ .System }}\n### Input:\n{{ .Prompt }}\n### Response:”’ >> Modelfile

step 4: Login your Ollama account (eg. mrjacktung)

  • My models ~> New ~> create new space with name: mradermacher-llamax3-8b-alpaca-gguf

step 5: Create repository

  • ollama create -f Modelfile mrjacktung/mradermacher-llamax3-8b-alpaca-gguf
  • ollama push mrjacktung/mradermacher-llamax3-8b-alpaca-gguf

step 6: Testing

  • ollama run mrjacktung/mradermacher-llamax3-8b-alpaca-gguf

Thanks to:

Star History

Star History Chart