undefined

upvote

points

by stephc_int133 hours ago |

upvote

by snek_case3 hours ago|

[-]

This would be an expensive benchmark to run on a regular basis, though I guess for the big AI labs it's nothing. Code quality is hard to objectively measure, however.

reply