undefined

points

by thesz16 hours ago |

comments

by amelius12 hours ago|

[-]

I'm looking forward to the model.toVHDL() method in PyTorch.

by sowbug6 hours ago|

parent|

[-]

Ugh, quick, everyone start panic-buying FPGAs now.

by throwup2384 hours ago|

parent|

[-]

largest FPGAs have on the order of tens of millions of logic cells/elements. They’re not even remotely big enough to emulate these designs except to validate small parts of it at a time and unlike memory chips or GPUs, companies don’t need millions of them to scale infrastructure.

(The chips also cost tens of thousands of dollars each)

by 8note4 hours ago|

parent|

[-]

they also arent power friendly

by Simboo9 hours ago|

parent|

prev|

[-]

Deep Differentiable Logic Gate Networks

by thesz1 hours ago|

parent|

[-]

I see you and I raise approximate logic synthesis [1] [2].

[1] https://www.sciencedirect.com/science/article/pii/S138376212...

[2] https://arxiv.org/abs/2506.22772

You can synthesize a logic circuit that is as complex as it gets to have a certain accuracy.

Deep differentiable logic networks, in my experience, do not scale well for larger (more inputs) logic elements. One still has to apply logic optimization and synthesis afterwards. So why not to synthesize ones own approximate circuit to the accuracy one's desire?

by androiddrew9 hours ago|

parent|

prev|

[-]

Is this a thing?

by mikeurbach4 hours ago|

parent|

[-]

I gave a short talk about compiling PyTorch to Verilog at Latte '22. Back then we were just looking at a simple dot product operation, but the approach could theoretically scale up to whole models.

https://capra.cs.cornell.edu/latte22/paper/2.pdf

https://www.youtube.com/watch?v=QxwZpYfD60g

by cpldcpu7 hours ago|

prev|

[-]

They mentioned that they using strong quantization (iirc 3bit) and that the model was degradeted from that. Also, they don't have to use transistors to store the bits.

by mirekrusin4 hours ago|

parent|

[-]

gpt-oss is fp4 - they're saying they'll next try mid size one, I'm guessing gpt-oss-20b then large one, i'm guessing gpt-oss-120b as their hardware is fp4 friendly

by amelius5 hours ago|

parent|

prev|

[-]

I think they are talking about the transistors that apply the weights to the inputs.

by cyanydeez7 hours ago|

prev|

[-]

Whats the theoretixal full wafer scale model they could produce?