Item 44238079

diggan • 2 days ago

The only mention of tools I could find is this:

> it significantly improves project planning, backend architecture, frontend design, and data engineering through sequenced, multi-step actions involving external tools or API.

I'm guessing this means it was trained with tool calling? And if so, does that mean it does tool calling within the thinking/reasoning, or within the main text? Seems unclear

simonw • 2 days ago

Tool calling isn't enabled in the official Magistral Small GGUF (or the Ollama one) which is sad. Hope they (or someone else) fix that soon.

2 replies

NitpickLawyer • 2 days ago

They have already released Devstral, which is a tool-specific finetune of the same base model. That works pretty well with cline (even though it was specifically tuned for open-hands).

This would likely be a good model for the "plan" mode in various agentic tools (cline, aider, cursor/windsurf/void, etc). So you'd have a chat in plan mode, then use devstral to actually implement that plan.

1 reply

diggan • 2 days ago

Devstral is targeting tool use+coding I think, so something like Magistral but also tool calling (during thinking) would be handy too, just for other use cases. But also beneficial in the context of creating plans for Devstral.

jmorgan • 1 day ago

Working on adding tool calling support to Magistral in Ollama. It requires a tokenizer change and also uses a new tool calling format. Excited to see the results of combining thinking + tool calling!