Mistral AI
Jump to navigation
Jump to search
- Products: Mistral 7B, Mixtral 8x7B, Mistral Medium
Mistral 7B uses Grouped-query attention (GQA) intended for faster inference and Sliding Window Attention (SWA) intended to handle longer sequences
Related
See also
Advertising: