undefined

points

[-]

Lol, I love that framing. Yeah, the small models have impressed me a lot during this work. The reasoning can be quite good, and definitely sufficient for a lot of cases. Just gotta nudge em back on track Every now and then and they'll figure it out.

by cornholio6 hours ago|

prev|

[-]

If I understood correctly, the model will get it right because it knows when it isn't right.

by zambelli6 hours ago|

parent|

[-]

Essentially, yes that's right! There's some subtlety in how to let it know it was wrong (returning things as tool errors because it trained on that), but that's the gist of it - sort of a self-correcting architecture.

by tomjakubowski3 hours ago|

parent|

prev|

[-]

https://en.wikipedia.org/wiki/Apophatic_theology

by jon_richards2 hours ago|

parent|

[-]

I was expecting this https://knowyourmeme.com/memes/the-missile-knows-where-it-is

by 3 hours ago|

prev|

[-]

deleted

by koolba5 hours ago|

prev|

[-]

A thousand monkeys on a thousand typewriters…

by zambelli5 hours ago|

parent|

[-]

That is the whole challenge, actually! A new metric I'm going to dogfood into forge is ETTWS - estimated time to working solution.

A simple retry loop around your whole workflow could, in some cases, be all you need. But it could mean many blind attempts to get through a workflow successfully. And hopefully there isn't a payment step partway through!

The fewer hard errors nix the whole workflow, the lower your ETTWS.

by DiogenesKynikos2 hours ago|

parent|

prev|

[-]

This is a thousand unusually smart monkeys who speak every major human language fluently and are proficient in every major programming language, but sometimes still make bizarre mistakes and need to be put back on track.

by jplusequalt1 hours ago|

parent|

[-]

This is fun for you?

by 5 hours ago|

parent|

prev|

[-]

deleted