Mistral AI
Jump to navigation
Jump to search
- Products:
Mistral 7B uses Grouped-query attention (GQA) intended for faster inference and Sliding Window Attention (SWA) intended to handle longer sequences
Related
See also
Advertising: