upvote
Yeah, but I wouldn't be surprised if they train the model on verification assisted by Lean.
reply
Arguing similarly to how stockfish, the chess engine, trains I would not be surprised if this is more common in the future. I don't know if they use any proof verification tools during their reinforcement learning procedure right now, as far as I know they've been focusing more on COT based strategies (w/o Lean). But I'm hardly an LLM expert, I don't know.
reply
That may be true for now, but it seems clear enough that letting the model use Lean in its internal reasoning process would be a great idea
reply
That I'd agree with! I really need to get around to learning Lean myself. It might be interesting to try and formalize some missing theoretical pieces from my field (or likely start smaller).
reply