Transformers Reading Group @ Mila
362 subscribers
42:06
#4 - Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Transformers Reading Group @ Mila
2.8K views • 1 year ago
43:01
#3 - Attending to graph transformers
Transformers Reading Group @ Mila
742 views • 1 year ago
34:30
#2 - Transformers from an optimization perspective
Transformers Reading Group @ Mila
138 views • 1 year ago
34:35
#1 - Mega: Moving Average Equipped Gated Attention
Transformers Reading Group @ Mila
505 views • 1 year ago
End of Videos