upvote
Storing the partial derivatives into the weights structure is quite the hack, to be honest. But everybody seems to do it like that.
reply
Great work! Might do it too in some other language...
reply
I got a convertion to Java. It worked (at least I think...) in the first try.

Then I want to convert this to my own programming language (which traspiles to C). I like those tiny projects very much!

reply
Zig, here.

Anything but Python

reply
At least python can do this exercise without pulling 3rd party dependencies :)
reply
What's missing from Zig and its std lib for this?
reply
Zig version [0] doesn't need any external dependencies.

0. https://tangled.org/m17e.co/microgpt

reply