undefined

points

[-]

Slightly related, I used Opus 4.6 to help me make marketing copy and ideas for my app. It understood the vibe I was going for on my baby-naming app (elation at discovery, curiosity, shared experiences), while 4.7 instantly wanted to pit the couples against each other (really highlighting the he said/she said) and the marketing copy went from "find a name easier" to "Our new feature is great. You're welcome." I can't get it to drop the snarky sass no matter how much I change CLAUDE.md, brand voice, etc.

All I did was upgrade claude code and use the new model. It most definitely exhibits misaligned behavior (compared to 4.6)

by philipkglass12 hours ago|

parent|

[-]

I tried Opus 4.7 for two days before I started beginning every session with "claude --model claude-opus-4-6".

I assume that 4.6 will become unavailable at some point, but I hope not any time soon. 4.7 hit usage limits faster, didn't do anything obviously better, and had more annoying behaviors in other aspects. I don't know if this is strictly a model issue or if there are also problems with how it's harnessed through Claude Code. I'm not willing to spend more time digging into it until I'm forced to.

by scottyah9 hours ago|

parent|

[-]

Join me in a petition for them to opensource 4.6 as their first model! It'll be like gemma4 but good enough for all the coding we do.

by tpurves8 hours ago|

prev|

[-]

Nobody is using LLMs to make lending decisions. They are using LLMs to read, extract and audit the supporting documents that go into normal well-tested, compliant and rules-based underwriting systems. And firms A/B test against humans doing the same work. The outcomes your are looking for are metrics like delivering faster results back to customers, with fewer mistakes and less fraud, more compliant, than a comparable human-only process.

by anonfunction7 hours ago|

parent|

[-]

[dead]