undefined

points

[-]

You're not really solving problems, you're retrieving the best match of solved problems from compressed corpus. And that corpus is available to many companies, meaning "hard" problems stop having "hard problem" value the moment they enter the weights of any model via the internet ... or distill from one model to another. Anthropics business model is commoditising knowledge, but as we see with the Fable model card, they only want it done to the knowledge of other businesses, in their own field, they totally hate it.

by aroman4 days ago|

parent|

[-]

I don’t think that’s an accurate or useful characterization of modern AI like Claude at all. It is not simply regurgitating knowledge. It applies its knowledge to create bespoke solutions to the problem you pose to it, and is able to self evaluate its progress towards the completion criteria. If you don’t think that counts as “problem solving”, your definition would exclude nearly all knowledge work and engineering.

by geraneum4 days ago|

parent|

[-]

People underestimate the vastness of training data (internet) and overestimate their ability to recognize if something is really bespoke. Not to say the no problem solving is happening, because there are many problems that we inefficiently solve again and again and the LLMs are making the solutions more accessible to everyone with a subscription.

by computably4 days ago|

parent|

prev|

[-]

> It applies its knowledge to create bespoke solutions to the problem you pose to it, and is able to self evaluate its progress towards the completion criteria.

It imitates applying knowledge. The imitation may be uncanny, but assigning LLMs intentionality and ToM is a category error.

by igregoryca4 days ago|

parent|

[-]

Does "applying knowledge" necessitate human-like intentionality and theory of mind? If you insist it does, and this is a category error, then we need a new category.

By analogy, consider that many have referred to classical, deterministic computing as some kind of "thinking" for the last half century+. Does this stop being kosher when the computer has an uncanny propensity for human language? Perhaps, but the computer is still clearly chewing through problems that would have required a lot of human thinking (e.g., arithmetic) in ages past.

I haven't seen any genuine proposals for words to replace the human mind analogues, let alone proposals that the anglosphere would plausibly adopt en masse.

by GiffertonThe3rd4 days ago|

parent|

prev|

[-]

Indubitably, computably.

by squeegmeister4 days ago|

parent|

prev|

[-]

It’s like saying you can’t make a unique sentence unless you first make unique words

by naasking4 days ago|

parent|

prev|

[-]

> You're not really solving problems, you're retrieving the best match of solved problems from compressed corpus.

This is not correct. LLMs interpolate in a high dimensional space, so you're actually composing the best matches in a compressed corpus to find novel points/paths in that space. That is problem solving.

by ahtihn5 days ago|

prev|

[-]

> Back-end improvements (if done right), should improve platform speed, stability, scalability etc. which should have revenue implication

Depends entirely on the domain. If you're selling entreprise software, this kind of stuff barely matters for sales.

It can reduce operational costs which is good but there's a limit to how much that's worth.

by UqWBcuFx6NV4r5 days ago|

parent|

[-]

Yep, there are many, many, non-niche domains in which this doesn’t mean much at all.

by skywhopper4 days ago|

prev|

[-]

The thing about AI-generated “solutions” is that they often go down bad rabbit holes and need to be re-run, or since they are so “cheap” to create they are often just thrown away and rebuilt when requirements evolve. Plus, just more stuff is created and needs to be maintained. So in the end, your efficiency gains go out the window.

by ponector4 days ago|

prev|

[-]

In my experience, the challenge in software development is not to solve a problem, but to define the outcome, the scope, the acceptance criteria etc.

by majkinetor4 days ago|

parent|

[-]

Exactly, this is the hardest part and the reason why many projects fail

by fendy30024 days ago|

prev|

[-]

20x the cost means you need to have fable to be 20x better than the alternative, which is a tall order. And there's more options out there too, perhaps the 4x cost is enough.

This means if the deepseek / under 1k alternative is at least x1.2 improvement, fable needs to be x24, which I think is very2 unreasonable. It is possible for it to worth if it can x2 a $20k SWE, though I doubt it can do that.

by henry20235 days ago|

prev|

[-]

“Ability to solve hard problems in days vs weeks as immense value”. Citation needed.

LlMs are incredible don’t get me wrong, but they are good on tiny contexts (writing a script). Not on software engineering (adding features to Chrome).

by AussieWog934 days ago|

parent|

[-]

Honestly, LLMs been OK at adding features to software since around Opus 4.5. From what I've tried of Fable, it's a decent step up from the Opus models and I can only see things getting better.

by system25 days ago|

prev|

[-]

>pushback on a few points

Claude keeps telling me this when I argue with it. LMAO.

by UqWBcuFx6NV4r5 days ago|

parent|

[-]

“gently push back”