Hacker News
new
past
comments
ask
show
jobs
Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery [pdf]
(research.nvidia.com)
2 points
by
gmays
1 hours ago
|
0 comments