undefined

points

[-]

> I read the V1 code this time instead of guessing

Does the LLM even keep a (self-accessible) record of previous internal actions to make this assertion believable, or is this yet another confabulation?

by johnmaguire6 hours ago|

parent|

[-]

Yes, the LLM is able to see the entire prior chat history including tool use. This type of interaction occurs when the LLM fails to read the file, but acts as though it had.

by al_borland6 hours ago|

prev|

[-]

This seems like the experience I've had with every model I've tried over the last several years. It seems like an inherent limitation of the technology, despite the hyperbolic claims of those financially invested in all of this paying off.

by smt886 hours ago|

parent|

[-]

Opus 4.6 pre-nerf was incredible, almost magical. It changed my understanding of how good models could be. But that's the only model that ever made me feel that way.

by al_borland5 hours ago|

parent|

[-]

That was better, but still not to the point that I just let it go on my repo.

by whalesalad6 hours ago|

parent|

prev|

[-]

Yes! I genuinely got a LOT of shit done with Opus 4.6 "pre nerf" with regular old out-of-the-box config, no crazy skills or hacks or memory tweaks or anything. The downfall is palpable. Textbook rugpull.

by ed_elliott_asc5 hours ago|

prev|

[-]

If it isn’t working for you why don’t you choose an older model? 4.6

by ericol6 hours ago|

prev|

[-]

Matches what I am experiencing. Makes incredible stupid mistakes.

The weird stuff is yesterday I asked it to test and report back on a 30+ commit branch for a PR and it did that flawlessly.

by alphabettsy6 hours ago|

prev|

[-]

The docs suggest not using max effort in most cases to avoid overthinking :shrug:

by whalesalad6 hours ago|

parent|

[-]

They've jumped the shark. I truly can't comprehend why all of these changes were necessary. They had a literal money printing machine that actually got real shit done, really well. Now it's a gamble every time and I am pulling back hard from Anthropic ecosystem.

by geraldwhen5 hours ago|

parent|

[-]

It seems clear that it was a money spending machine, not a money printing machine.