upvote
I didn't know model merging like that was possible. (Obviously possible from a pure software standpoint but I'm surprised it's effective)
reply
As another poster above linked, it’s been shown to be effective since 2022: https://arxiv.org/abs/2203.05482
reply
So the problem isn’t in the missing attribution to Qwen, but with the fact that they didn’t mention Nex-N2 Pro right?
reply
The problem is that they claimed to have made a big achievement with their home grown post training, and they expected to receive a lot of praise for it.

Then researchers looked at the weights and there is no post training at all.

They are now attributing both models they merged, but their excuse for the lack of post training is to claim they accidentally uploaded the wrong files.

reply
I’d believe they accidentally uploaded the wrong files if they uploaded the correct ones. To state that they accidentally uploaded something else and then not upload the correct version means they probably do not have anything and either hope people forget about this or they are scrambling to have something that is at least close to their original claim.
reply
deleted
reply
[dead]
reply