From b93f8ef2846be1696139a227d339e57dc2d4d2d3 Mon Sep 17 00:00:00 2001 From: Eugene Cheah Date: Sat, 9 Dec 2023 15:55:26 -0800 Subject: [PATCH] tweak --- docs/README.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/docs/README.md b/docs/README.md index 3e72f4a..f864f8a 100644 --- a/docs/README.md +++ b/docs/README.md @@ -19,11 +19,11 @@ So it's combining the best of RNN and transformer - great performance, fast infe | Version | v4 Raven | v4 World | v5 World | v6 World | |---|---|---|---|---| -| Paper | 🎓[Paper Accepted @ EMNLP 2023](https://arxiv.org/abs/2305.13048) | (no architecture change) | wip | wip | +| Paper | 🎓[Paper Accepted @ EMNLP 2023](https://arxiv.org/abs/2305.13048) | (no architecture change) | 🔧 wip | 🧪wip | | Overall Status | 🌚 EOL - Recommended to use v5 world instead | ✅ GA - Recommended to switch to v5 world when possible | 🔧 Training | 🧪 Prototyping | -| 0.4B model | [Fully Trained : rwkv-pile-430m](https://huggingface.co/RWKV/rwkv-4-430m-pile) | ✅ [Fully Trained](https://huggingface.co/RWKV/rwkv-4-world-430m) | ✅ [Fully Trained](https://huggingface.co/BlinkDL/rwkv-5-world/blob/main/RWKV-5-World-0.4B-v2-20231113-ctx4096.pth) | 🧪Prototyping | -| 1.5B model | [Fully Trained : rwkv-raven-1b5](https://huggingface.co/RWKV/rwkv-raven-1b5) | ✅ [Fully Trained](https://huggingface.co/RWKV/rwkv-4-world-1b5) | ✅ [Fully Trained](https://huggingface.co/BlinkDL/rwkv-5-world/blob/main/RWKV-5-World-1B5-v2-20231025-ctx4096.pth) | 🧪Prototyping | -| 3B model | [Fully Trained : rwkv-raven-3b](https://huggingface.co/RWKV/rwkv-raven-3b) | ✅ [Fully Trained](https://huggingface.co/RWKV/rwkv-4-world-3b) | 🔧 [Finalizing ...](https://huggingface.co/BlinkDL/rwkv-5-world/blob/main/RWKV-5-World-3B-v2-20231118-ctx16k.pth) | 🧪Prototyping | +| 0.4B model | [Fully Trained : rwkv-pile-430m](https://huggingface.co/RWKV/rwkv-4-430m-pile) | ✅ [Fully Trained](https://huggingface.co/RWKV/rwkv-4-world-430m) | ✅ [Fully Trained](https://huggingface.co/BlinkDL/rwkv-5-world/blob/main/RWKV-5-World-0.4B-v2-20231113-ctx4096.pth) | 🧪 Prototyping | +| 1.5B model | [Fully Trained : rwkv-raven-1b5](https://huggingface.co/RWKV/rwkv-raven-1b5) | ✅ [Fully Trained](https://huggingface.co/RWKV/rwkv-4-world-1b5) | ✅ [Fully Trained](https://huggingface.co/BlinkDL/rwkv-5-world/blob/main/RWKV-5-World-1B5-v2-20231025-ctx4096.pth) | 🧪 Prototyping | +| 3B model | [Fully Trained : rwkv-raven-3b](https://huggingface.co/RWKV/rwkv-raven-3b) | ✅ [Fully Trained](https://huggingface.co/RWKV/rwkv-4-world-3b) | 🔧 [Finalizing ...](https://huggingface.co/BlinkDL/rwkv-5-world/blob/main/RWKV-5-World-3B-v2-20231118-ctx16k.pth) | 🧪 Prototyping | | 7B model | [Fully Trained : rwkv-raven-7b](https://huggingface.co/RWKV/rwkv-raven-7b) | ✅ [Fully Trained](https://huggingface.co/RWKV/rwkv-4-world-7b) | 🔧 [Training ...](https://huggingface.co/BlinkDL/temp/blob/main/rwkv-x052-7b-world-v2-79%25trained-20231208-ctx4k.pth) | | | 14B model | [Fully Trained : rwkv-raven-14b](https://huggingface.co/RWKV/rwkv-raven-14b) | not-planned | scheduled | |