undefined

points

[-]

How is that a direct comparison? The link you gave has a quote that says it’s not:

> Scoped context: Our tests gave models the vulnerable function directly, often with contextual hints (e.g., "consider wraparound behavior"). A real autonomous discovery pipeline starts from a full codebase with no hints

They pointed the models at the known vulnerable functions and gave them a hint. The hint part is what really breaks this comparison because they were basically giving the model the answer.

by cyanydeez2 hours ago|

parent|

[-]

Does no one defending mythos understand how nested foreloops work?

loop through each repo: loop through each file: opencode command /find_wraparoundvulnerability next file next repo

I can run this on my local LLM and sure, I gotta wait some time for it to complete, but I see zero distinguishing facts here.

by Dylan168071 hours ago|

parent|

[-]

The question is how customized those hints were. That changes whether looping over an entire code base is possible or not.

by u_fucking_dork1 hours ago|

parent|

prev|

[-]

Please do so, looking forward to your write up