undefined

points

by trostaft4 hours ago |

comments

by vatsachak4 hours ago|

[-]

Yeah, but I wouldn't be surprised if they train the model on verification assisted by Lean.

by trostaft3 hours ago|

parent|

[-]

Arguing similarly to how stockfish, the chess engine, trains I would not be surprised if this is more common in the future. I don't know if they use any proof verification tools during their reinforcement learning procedure right now, as far as I know they've been focusing more on COT based strategies (w/o Lean). But I'm hardly an LLM expert, I don't know.

by ComplexSystems3 hours ago|

prev|

[-]

That may be true for now, but it seems clear enough that letting the model use Lean in its internal reasoning process would be a great idea

by trostaft3 hours ago|

parent|

[-]

That I'd agree with! I really need to get around to learning Lean myself. It might be interesting to try and formalize some missing theoretical pieces from my field (or likely start smaller).