https://arxiv.org/abs/2302.10866
Hyena Hierarchy: Towards Larger Convolutional Language Models (Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y. Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Ré)
SSM으로 가다보니 다시 CNN으로 돌아오게 되는 것도 흥미로운 상황이네요.
#state_space_model #convolution