undefined

points

by ai_slop_hater12 hours ago |

comments

by otabdeveloper412 hours ago|

[-]

MoE and such are basically performance enhancements, they don't make the model smarter.

by yababa_y12 hours ago|

parent|

[-]

separately trained experts can surpass performance in their activated regime and DOES result in a smarter model, the Claude system cards talk about this and eg there is https://openreview.net/forum?id=iydmH9boLb to read...

by jmalicki6 hours ago|

parent|

prev|

[-]

Performance enhancements are huge though.

If you can make the existing model faster, you can then save your inference budget to then make your model bigger, which then makes it smarter.

A lot of how smart the models can be comes down to budget. If you can make your existing thing cheaper, you can instead make it bigger for the same price.

by otabdeveloper45 hours ago|

parent|

[-]

> to then make your model bigger, which then makes it smarter

There's diminishing returns and at some point making a model bigger makes it dumber.

by TheHalfDeafChef5 hours ago|

parent|

prev|

[-]

Not really “smarter” though? It’s just a big probability engine.

(Not trying to flame bait or anything. I just wouldn’t call LLM as exhibiting intelligence. It is great at making connections based on probability but doesn’t have a semantic understanding of what it is doing)