-
Notifications
You must be signed in to change notification settings - Fork 65
Pull requests: intel/xFasterTransformer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add env param KV_CACHE_LOCATION to control kv cache memory numanode location
#462
opened Jun 28, 2024 by
a3213105
Loading…
[Layers] Increased the threshold for enabling flashAttn
performance
performance related.
#428
opened Jun 3, 2024 by
abenmao
Loading…
[Kernel] Add dynamic onednn matmul.
performance
performance related.
#425
opened May 28, 2024 by
changqi1
Loading…
[Eval] Add eval test with opencompass.
benchmark
performance or accuracy benchmark
enhancement
New feature or request
ProTip!
Follow long discussions with comments:>50.