And even if you had the same data, there's no guarantee the random perturbations during training are driven by a PRNG and done in a way that is reproducible.
Reproducibility does not make something open source. Reproducibility doesn't even necessarily make something free software (under the GNU interpretation). I mean hell, most docker containers aren't even hash-reproducible.
Deepseek published a lot of their work in this area earlier this year and as a result the barrier isn’t as high as it used to be.
Their publications about producing Gemma is not accurate enough that even with data you would get the same results.
Also, even if it were for fine tuning, that would require an implementation of the model’s forward pass (which is all that’s necessary to run it).