So any tests done with models that have not been updated during the last days are no longer relevant and they must be repeated after updating the models and regenerating any other file formats, like GGUF files.
Not sure why (too amateur sorry).
Though I think qwen was natively trained on toolcalling.