Posts

Showing posts with the label neural network architectures

The Transformer Family Version 2.0 January 27, 2023 · 45 min · Lilian Weng

https://lilianweng.github.io/posts/2023-01-27-the-transformer-family-v2/