Magistral Small builds upon Mistral Small 3.2 with added reasoning capabilities through SFT from Magistral Medium traces and RL training. It's a small, efficient reasoning model with 24B parameters that can be deployed locally on a single RTX 4090 or 32GB RAM MacBook once quantized.
This model updates Magistral Small 1.1 with improved benchmark performance, better tone and persona, and fewer infinite generations.
Parameters
Custom configuration options included with this model