It won’t be a fair comparison against opus-4.6 but it will run quite well on your machine. I’ve tested qwen3.5 27B, Gemma4, minimax2.5 and Glm4.7 before on my m3 ultra. And i’d say this is the first model that I’m able to use for full agentic sessions. here is a pi session i just did and it worked quite well surprisingly: https://pi.dev/session/#c3d003becb1bfcc7ffbca04e89e1adf8
What seems very promising is that thinking blocks look coherent for the lack of a better word, and not that far away from thinking blocks (or rather, summaries) that I see from Claude models.
I think this could actually work for targeted worker agents that get explicit, detailed task instructions from better models.