upvote
Try running CPU-only inference to troubleshoot that. GPU layers will likely just ignore mmap.
reply