undefined

upvote

points

by TurdF3rguson20 hours ago |

upvote

by KronisLV14 hours ago|

[-]

It's odd that the model doesn't support it directly, but they at least have https://docs.z.ai/devpack/mcp/vision-mcp-server

reply

upvote

by maxk4218 hours ago|

[-]

Openrouter definitely supports vision models. Why would you have to give up vision?

reply

upvote

by Mashimo11 hours ago|

[-]

> Why would you have to give up vision?

Because you would have to switch model.

You can't just say "Oh, button X looks weird see [screenshot]" while coding with GLM. You would need to switch to another model and then maybe back.

reply

upvote

by TurdF3rguson17 hours ago|

[-]

For example if I want to paste a screenshot of what I mean, I can't.

reply

upvote

by cmrdporcupine20 hours ago|

[-]

If you using opencode or similar you can just temporarily switch models -- in the same session -- to something that has vision and have it look at your image. And then switch back.

reply

upvote

by gazpachotron19 hours ago|

[-]

Or create an agent or subagent that just looks at images, and specify a vision model for that agent.

reply

upvote

by TurdF3rguson15 hours ago|

[-]

I don't see how that helps, I would still need to somehow get the image into the coding model's context.

reply

upvote

by gmerc18 hours ago|

[-]

vision runs just fine locally for most usecases, so it's really just a skill to call that Ollama instance

reply

upvote

by nozzlegear20 hours ago|

[-]

Why's that?

reply