Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 212 Bytes

220831 Efficient Sparsely Activated Transformers.md

File metadata and controls

7 lines (4 loc) · 212 Bytes

https://arxiv.org/abs/2208.14580

Efficient Sparsely Activated Transformers (Salar Latifi, Saurav Muralidharan, Michael Garland)

moe 레이어를 포함해서 latency를 목표한 transformer search.

#nas #moe