upvote
You're not really solving problems, you're retrieving the best match of solved problems from compressed corpus. And that corpus is available to many companies, meaning "hard" problems stop having "hard problem" value the moment they enter the weights of any model via the internet ... or distill from one model to another. Anthropics business model is commoditising knowledge, but as we see with the Fable model card, they only want it done to the knowledge of other businesses, in their own field, they totally hate it.
reply
I don’t think that’s an accurate or useful characterization of modern AI like Claude at all. It is not simply regurgitating knowledge. It applies its knowledge to create bespoke solutions to the problem you pose to it, and is able to self evaluate its progress towards the completion criteria. If you don’t think that counts as “problem solving”, your definition would exclude nearly all knowledge work and engineering.
reply
People underestimate the vastness of training data (internet) and overestimate their ability to recognize if something is really bespoke. Not to say the no problem solving is happening, because there are many problems that we inefficiently solve again and again and the LLMs are making the solutions more accessible to everyone with a subscription.
reply
> It applies its knowledge to create bespoke solutions to the problem you pose to it, and is able to self evaluate its progress towards the completion criteria.

It imitates applying knowledge. The imitation may be uncanny, but assigning LLMs intentionality and ToM is a category error.

reply
Does "applying knowledge" necessitate human-like intentionality and theory of mind? If you insist it does, and this is a category error, then we need a new category.

By analogy, consider that many have referred to classical, deterministic computing as some kind of "thinking" for the last half century+. Does this stop being kosher when the computer has an uncanny propensity for human language? Perhaps, but the computer is still clearly chewing through problems that would have required a lot of human thinking (e.g., arithmetic) in ages past.

I haven't seen any genuine proposals for words to replace the human mind analogues, let alone proposals that the anglosphere would plausibly adopt en masse.

reply
Indubitably, computably.
reply
It’s like saying you can’t make a unique sentence unless you first make unique words
reply
> You're not really solving problems, you're retrieving the best match of solved problems from compressed corpus.

This is not correct. LLMs interpolate in a high dimensional space, so you're actually composing the best matches in a compressed corpus to find novel points/paths in that space. That is problem solving.

reply
> Back-end improvements (if done right), should improve platform speed, stability, scalability etc. which should have revenue implication

Depends entirely on the domain. If you're selling entreprise software, this kind of stuff barely matters for sales.

It can reduce operational costs which is good but there's a limit to how much that's worth.

reply
Yep, there are many, many, non-niche domains in which this doesn’t mean much at all.
reply
The thing about AI-generated “solutions” is that they often go down bad rabbit holes and need to be re-run, or since they are so “cheap” to create they are often just thrown away and rebuilt when requirements evolve. Plus, just more stuff is created and needs to be maintained. So in the end, your efficiency gains go out the window.
reply
In my experience, the challenge in software development is not to solve a problem, but to define the outcome, the scope, the acceptance criteria etc.
reply
Exactly, this is the hardest part and the reason why many projects fail
reply
20x the cost means you need to have fable to be 20x better than the alternative, which is a tall order. And there's more options out there too, perhaps the 4x cost is enough.

This means if the deepseek / under 1k alternative is at least x1.2 improvement, fable needs to be x24, which I think is very2 unreasonable. It is possible for it to worth if it can x2 a $20k SWE, though I doubt it can do that.

reply
“Ability to solve hard problems in days vs weeks as immense value”. Citation needed.

LlMs are incredible don’t get me wrong, but they are good on tiny contexts (writing a script). Not on software engineering (adding features to Chrome).

reply
Honestly, LLMs been OK at adding features to software since around Opus 4.5. From what I've tried of Fable, it's a decent step up from the Opus models and I can only see things getting better.
reply
>pushback on a few points

Claude keeps telling me this when I argue with it. LMAO.

reply
“gently push back”
reply