Hacker News
new
past
comments
ask
show
jobs
points
by
_bin_
284 days ago
|
comments
by
dkga
284 days ago
|
[-]
So, on an M4 I sometimes get faster training on plain vanilla jax compared to the same model in pytorch or tensorflow. And jax-metal often breaks :/
reply
by
_bin_
284 days ago
|
parent
|
[-]
No kidding? Might switch to CPU then. And yeah jax-metal is so utterly unreliable. I ran across an issue it turns out reduces to like a 2 line repro example which has been open on github for the better part of a year without updates
reply