undefined

upvote

points

by nee1r2 days ago |

upvote

by ilaksh3 hours ago|

[-]

How do I access this? Any HF or API coming?

Any benchmark comparisons to Fara-7B or Sonnet 4.6, Qwen 3.5 etc.?

reply

upvote

by dangoodmanUT4 hours ago|

[-]

11 million hours of data is a lot, did you have to synthesize it at all, or was it purely collected?

reply

upvote

by nee1r2 hours ago|

[-]

collected! no synthetic

reply

upvote

by arkmm3 hours ago|

[-]

Get ready for the acquisition offers.

reply

upvote

by AndrewKemendo4 hours ago|

[-]

This looks like a really promising approach

In particular the Forward rollout module is very important. It aligns your (effectively) world model with what it expects from the world, and keeping those in sync I think gives this the power it needs to be able to generate the state action pairs to continuously train semi supervised

reply