still no minimal reasoning in G3.1P :( (this is why Opus 4.6 is worth the price ...

iCarrot · 2026-02-20T01:51:50 1771552310

You can turn off thinking in Gemini pro models by using completion mode.

Essentially, append a message with role=model and minimal text part, such as a simple "A", at the end of the "contents" array. The model will try to complete the message without using any thought tokens.

You can also set the model message to start with "think" or something along that line and watch it thinks out loud (or melts down with over-thinking and stop due to reaching maximum output token)

``` [ { "parts": [{"text": "hello"}], "role": "user" }, { "parts": [{"text": "*think"}], "role": "model" } ] ```

jbellis · 2026-02-26T15:29:07 1772119747

TIL gemini still supports completion mode, that's super useful!

sunaookami · 2026-02-19T20:04:17 1771531457

Thinking is just tacked on for Anthropic's models and always has been so leaving it off actually produces better results everytime.

girvo · 2026-02-19T21:35:40 1771536940

What about for analysis/planning? Honestly I've been using thinking, but if I don't have to with Opus 4.6 I'm totally keen to turn it off. Faster is better.

sunaookami · 2026-02-20T17:08:12 1771607292

I've always just used the "Plan mode" in Claude Code, I don't know if it uses thinking? I have "MAX_THINKING_TOKENS" in my settings.json set to "0", too. Didn't notice a drop in performance, I find it better because it doesn't overthink ("wait, let me try..."). Likely depends on a case-by-case basis (as so often with AI). For me, it's better without thinking.