undefined

points

[-]

Not these. I wonder if the well is poisoned there. The models know that these are "unpossible", so it might not solve them just because… Maybe some day.

I am just testing it on stuff I know intimately myself. I would probably not understand a proof of Collatz if it was dansing in front of me!

by komali25 days ago|

parent|

[-]

So, what kind of problems are you having it try to solve?

Sorry to belabor this but it's basically pointless saying you have nuts it can't crack without showing us the nuts.

by black_knight5 days ago|

parent|

[-]

I don’t care to share my exact problems. Mostly because gpt -5.5 hallucinates false solutions, and I would rather not have people reply with "Oh but ChatGPT solves it!", because it takes expert knowledge to debunk them. To their credit ChatGPT will admit their, very fundamental mistakes when pointed out to them. But also because no-one would really care.

I gave a high level description of the problems in a sibling thread. They are the kind of small problems which I suppose every researcher has lying around, waiting for them to think about some day. But not the big problem everyone is waiting for to be solved.

My comment was not meant to be a tease – sorry! I assumed there would be other people in a similar situation, who might relate.

by neonstatic5 days ago|

parent|

prev|

[-]

Bro, you are being left behind bro, it's amazing bro...

by Lerc5 days ago|

parent|

prev|

[-]

That's a bit of a tricky point. I have had quite a lot of problems with models informing me what I am attempting is impossible. If no-one has done it, or at least it doesn't know about it being done it tends to fall back on people voicing their baseless speculations, and for just about anything you propose, you can find a person who will loudly proclaim it is impossible.

The curse of the 'use case' comes in here too. When people think that everything should have a use case, that's a lot of training data suggesting to a model that things should only be used for what someone has already thought of.

A couple of times I have had to manually code proof of concept pieces so that the model breaks out of that "unpossible" mode and actually helps me.

I can't remember if it was chatGPT or Claude, but when I showed it how to get a MessagePort in its JavaScript executor through to the artifact/canvas, it quickly went from "That can't be done" to positively enthusiastic about the possibilities. I suspect those shenanigans will be well off the table for Fable though.

by unnouinceput5 days ago|

parent|

prev|

[-]

Stop dancing and share the prompt, we're dying to see it

by black_knight5 days ago|

parent|

[-]

Hey, stop asking to see my nuts! My nuts are private – okay?

(Joking aside, see sibling threads.)

by andriy_koval4 days ago|

prev|

[-]

> The Riemann hypothesis, PvNP, and the Collatz conjecture.

Did you add "make no mistake" to your prompt?

by mastermage5 days ago|

prev|

[-]

is this a joke? Seriously? These are some of hardest problems in Math period. 100 if not thousands of the greates minds in history have attempted to solve these problems. And you think that the current level of AI can blow through them? It is also a possibility that for example the Riemann Hypothesis is just not provable. (Goedels Theorem).

by black_knight5 days ago|

parent|

[-]

No one is expecting that! I expect _kb was sarcastic/making a point.

Recently (last couple of months?) these models are becoming useful tools for mathematicians, because they can solve easier problems more quickly, meaning that one can tackle bigger challenges (but maybe not RH et al) piece by piece.

But, there are still definite limits, where one could expect an expert human to solve things, given time, but models do not. Thus, more intelligence would be nice!

by mastermage5 days ago|

parent|

[-]

if it was sarcastic then whoosh on me.

by _kb4 days ago|

parent|

[-]

It was a bit of humour. It would be much for feasible to have an LLM generate programs that solve those problems rather than solving directly. I tried to make a start, but I couldn't even vibe a simple tool that would let me reliably validate if generated solvers would halt or loop forever.

by mastermage4 days ago|

parent|

[-]

> if generated solvers would halt or loop forever.

I am pretty sure this time I am catching the sarcasm here. Kudos you had me in the first half.

by moffkalast5 days ago|

prev|

[-]

Ayy lmao