From b308cf0664b118bd0e6048fab9f9500cb5da1a12 Mon Sep 17 00:00:00 2001 From: Eugene Cheah Date: Thu, 5 Oct 2023 17:19:22 +0800 Subject: [PATCH] Update architecture.md --- docs/advance/architecture.md | 1 + 1 file changed, 1 insertion(+) diff --git a/docs/advance/architecture.md b/docs/advance/architecture.md index 69730ef..665507f 100644 --- a/docs/advance/architecture.md +++ b/docs/advance/architecture.md @@ -67,6 +67,7 @@ If you would like a breakdown on the math involved, you can find it covered in a Finally once you gotten a grasp on the fundemantals, you can start studying the training and cuda code respectively in the [main repo](https://github.com/BlinkDL/RWKV-LM) **Additional links/resources** +- Notes on RWKV architecture: [https://www.ivanleo.com/blog/a_guide_to_rwkv](https://www.ivanleo.com/blog/a_guide_to_rwkv) - RWKV model math: [https://ben.bolte.cc/rwkv-model](https://ben.bolte.cc/rwkv-model) - RWKV explained: [https://fullstackdeeplearning.com/blog/posts/rwkv-explainer/](https://fullstackdeeplearning.com/blog/posts/rwkv-explainer/) - How RWKV works: [https://johanwind.github.io/2023/03/23/rwkv_details.html](https://johanwind.github.io/2023/03/23/rwkv_details.html)