[Bug]: Call RWKV6Attention and report an error environment. #74

synbol · 2024-10-31T10:41:47Z

Describe the bug

Error message:
python: /project/lib/Analysis/Allocation.cpp:47: std::pair<llvm::SmallVector, llvm::SmallVector > mlir::triton::getCvtOrder(mlir::Attribute, mlir::Attribute): Assertion `!(srcMmaLayout && dstMmaLayout && !srcMmaLayout.isAmpere()) && "mma -> mma layout conversion is only supported on Ampere"' failed.

Steps to reproduce the bug

Calling process:
from fla.layers.rwkv6 import RWKV6Attention
self.attention = RWKV6Attention(hidden_size=config.dim, num_heads=config.n_head)

o, _, past_key_values = self.attention(self.attention_norm(x), attention_mask=mask, past_key_values=past_key_values)

Expected behavior

None

Environment info

Environment:
torch 2.4.1
triton 3.0.0
einops 0.8.0

synbol · 2024-10-31T10:42:41Z

self.attention = RWKV6Attention(hidden_size=config.dim, num_heads=config.n_head, layer_idx=layer_id)

yzhangcs · 2024-10-31T16:18:08Z

Hi, can you provide some runnable code for reproduction. It works normally for me by running this

python benchmark_training_throughput.py --name rwkv6

sustcsonglin · 2024-10-31T19:42:17Z

Hi, thanks for reporting it. What is your GPU model?

synbol added the bug Something isn't working label Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Call RWKV6Attention and report an error environment. #74

[Bug]: Call RWKV6Attention and report an error environment. #74

synbol commented Oct 31, 2024

synbol commented Oct 31, 2024

yzhangcs commented Oct 31, 2024

sustcsonglin commented Oct 31, 2024

[Bug]: Call RWKV6Attention and report an error environment. #74

[Bug]: Call RWKV6Attention and report an error environment. #74

Comments

synbol commented Oct 31, 2024

Describe the bug

Steps to reproduce the bug

Expected behavior

Environment info

synbol commented Oct 31, 2024

yzhangcs commented Oct 31, 2024

sustcsonglin commented Oct 31, 2024