upvote
RL is more than facts. Synthetic feedback is an obvious approach. Does the model suggest code that compiles and performs well?
reply
Lot of the things aren't facts that could be stated. No one can just see the dictionary or translation of words and start talking in that language.

There isn't a clear definition of what is knowledge and what is intelligence. Is being able to write in C knowledge? Is knowing undefined behaviour in that knowledge?

reply