undefined

points

by achrono4 hours ago |

comments

by postalrat25 minutes ago|

[-]

If A goes to B who then goes to C does C know A?

by erdevs4 hours ago|

prev|

[-]

I am a human and I don't know how to interpret this prompt.

by rapatel03 hours ago|

prev|

[-]

Ran the same query and there is a ton of stuff, but it looks like it's reasoning through the ambiguity of the sentence. It still gets the right answer. Moreover, if we consider the FLOPs expended to get to the answer, and compare that to opus, I think it's still a net win.

My hunch is that Opus scale models probably have shortcuts encoded into the model that handle these ambiguities cases, wheres this model has learned a program to reason through the edge case (crystalized vs fluid intelligence). Remembering that probablity (frontier) vs calculating it on the fly (vibethink)

by nolist_policy4 hours ago|

prev|

[-]

> Multi-level Quality Control.

> [...]

> LLM-based Query Quality Filtering. We utilize capable LLMs to assess query quality, filtering out samples with incomplete descriptions, unreasonable conditions, invalid logic, or an inability to effectively assess target knowledge points.