upvote
NVMEs are much, much slower than RAM. Especially unified/soldered RAM.
reply
To be fair, llama.cpp had this feature for over a year now. It just applies to GGUF.
reply
I got an m3, I will test it on metal and check how it goes
reply