undefined

points

by johnvanommen1 hours ago |

comments

by layer81 hours ago|

[-]

An unbelievably good deal at $4000 plus?

by johnvanommen1 hours ago|

parent|

[-]

Possibly the best deal there is

I really need to shut up, or bite the bullet and by one.

If you graph the tokens per second on the 5090, your jaw will hit the floor at how cheap it is

by gruez1 hours ago|

parent|

[-]

With only 32gb of vram, you can only run small/quantized models, in which case what's the point? At $4000, that gets you 20 months of 10x claude or chagpt subscriptions, which provide far better models. You'd need some use case where you can tolerate worse models, and use a steady supply of them. That doesn't match most people's usage patterns.

by regularfry2 minutes ago|

parent|

[-]

If you can do what you need with qwen3.6-27b, it starts to look really interesting. That model is crazy good for the size, but it's a pain tweaking the params to run it on a 4090 with decent context and decent token speed. A 5090 looks tasty from that point of view, and only more so if you think in terms of the probability of that model being roflstomped by something in the same weight class in the next couple of years. I reckon that probability is significantly non-zero, but fundamentally it's a guess.

by echoangle40 minutes ago|

parent|

prev|

[-]

Or you want to process private data or don’t have reliable connectivity. There are a few more reasons for local models I think.

by EnPissant31 minutes ago|

parent|

prev|

[-]

Also, electricity isn't free.

by Galanwe36 minutes ago|

parent|

prev|

[-]

The 5090 is crap for inference. Unless you like dummy models, sure they will run at light speed. All the rage is MoE with 500B-1T weights nowadays.

by mattmanser1 hours ago|

prev|

[-]

It's gone up like 300% in cost in the last year.

by JacobAsmuth1 hours ago|

parent|

[-]

Which surely is the highest it'll ever be! You're suggesting that the price will go down in the future? Would love to hear more about your thought process!

by bcrosby951 hours ago|

parent|

[-]

Are you saying we're entering a period where tech increases in price instead of decreases? I guess it depends upon time horizon, but your statement isn't very specific.

by johnvanommen1 hours ago|

parent|

prev|

[-]

I believe msrp is $2000 right?

by EnPissant1 hours ago|

parent|

prev|

[-]

There was only a very brief time it was selling for MSRP (last fall for $2000). Even if you use that as the previous data point, it's only 200% increased.

by forrestthewoods1 hours ago|

prev|

[-]

if you can buy one!

The RTX 5090 is faster than an H200. It just has less ram (32 vs 141), doesn't have NVLink, and technically isn't allowed to be used in a datacenter.

The datacenter GPUs sell at an 80% margin. They're incredibly overpriced. But the laws of supply and demand are undefeated and so here we all are.

by alphabeta3r561 hours ago|

parent|

[-]

> The RTX 5090 is faster than an H200. It just has less ram

H200 has HBM and much more 64-bit compute

by forrestthewoods44 minutes ago|

parent|

[-]

Let me try again.

RTX 5090 has more CUDA cores that run at a higher clock speed. H200 has more RAM and significantly more RAM bandwidth.

Which one is net faster depends on your use case. But you may be very surprised that many workflows are faster on an RTX 5090!