> Benchmarks suggest this model loses to Deepseek-R1 in every one-shot compariso...

		littlestymaar 3 months ago \| parent \| context \| favorite \| on: Magistral — the first reasoning model by Mistral A... > Benchmarks suggest this model loses to Deepseek-R1 in every one-shot comparison. That's not particularly surprising though as the Medium variant is likely close to ten times smaller than DeepSeek-R1 (granted it's a dense model and not an MoE, but still).