Novel Architectures
Emerging architectural innovations beyond transformer-based models.
Overview
Research into new model architectures that could overcome transformer limitations.
Key Areas
State Space Models
- Mamba and variants
- Linear attention mechanisms
Hybrid Architectures
- Combinations of different attention patterns
- Mixture of experts innovations
Recurrent Approaches
- RWKV
- RetNet