undefined

points

[-]

That slot is called USB-C. I can fully imagine inference ASICs coming in powerbank form factor that you'd just plug and play.

by bagful10 hours ago|

parent|

[-]

Like the chip-software in Gibson’s sprawl, from the micro-soft to the ROM cowboy to the Aleph, the endgame of computertool distribution is via single-use chunks of quasi-biological computronium

by avisser8 hours ago|

parent|

[-]

Michael Bay just read "computronium" and spawned an 8 movie franchise in his head.

by zupa-hu15 hours ago|

parent|

prev|

[-]

This would be a hell of a hot power bank. It uses about as much power as my oven. So probably more like inside a huge cooling device outside the house. Or integrated into the heating system of the house.

(Still compelling!)

by fennecbutt14 hours ago|

parent|

[-]

*the whole server uses 2.2kw or whatever, not a single board. I think that was for 8 boards or something.

by zupa-hu9 hours ago|

parent|

[-]

Oh does it? Thanks for the clarification then. Their home page said 2.5kW so I assumed that's what it is.

To be fair, 2.5kW does sound too much for a single 3x3cm chip, it would probably melt.

by fennecbutt3 hours ago|

parent|

[-]

More powwwwaaa!

Yeah, though I suppose once we get properly 3d silicon I would not be surprised at power rating for that, 3cm^3 would be something to behold.

by amelius13 hours ago|

parent|

prev|

[-]

> USB-C

With these speeds you can run it over USB2, though maybe power is limiting.

by GTP11 hours ago|

parent|

[-]

You would likely need external power anyway.

by Hendrikto12 hours ago|

parent|

prev|

[-]

USB-C is just a form factor and has nothing to do with which protocol you run at which speeds.

by amelius11 hours ago|

parent|

[-]

I wasn't talking about the form factor.

by ekianjo14 hours ago|

parent|

prev|

[-]

Not if you need 200w power to run inference.

by stavros13 hours ago|

parent|

[-]

USB-C can do up to 240W. These days I power all my devices with a USB hub, even my Lipo charger.

by grayhatter8 hours ago|

parent|

[-]

Have you seen a device that can supply 240w and act as a data host? Or is the 240w only from dedicated chargers?

by stavros8 hours ago|

parent|

[-]

I haven't seen one, but I also don't tend to use it for anything other than a power supply, so I wouldn't know. Since the standard supports it, though, it's just a matter of the market needing a device like that.

by XorNot16 hours ago|

parent|

prev|

[-]

Pretty sure it'd just be a thumbdrive. Are the Taalas chips particularly large in surface area?

by dmurray16 hours ago|

parent|

[-]

The only product they've announced at the moment [0] is a PCI-e card. It's more like a small power bank than a big thumb drive.

But sure, the next generation could be much smaller. It doesn't require battery cells, (much) heat management, or ruggedization, all of which put hard limits on how much you can miniaturise power banks.

[0] https://taalas.com/the-path-to-ubiquitous-ai/

by yonatan807011 hours ago|

parent|

[-]

I wouldn't call that size a small power bank. That chip is in the same ballpark as gaming GPUs, and based on the VRMs in the picture it probably draws about as much power.

But as you said, the next generations are very likely to shrink (especially with them saying they want to do top of the line models in 2 generations), and with architecture improvements it could probably get much smaller.

by ChrisMarshallNY13 hours ago|

parent|

prev|

[-]

I’m old enough to remember your typical computer filling warehouse-sized buildings.

Nowadays, your average cellphone has more computing power than those behemoths.

I have a micro SD card with 256GB capacity, and I think they are up to 2TB. On a device the size of a fingernail.

by slfnflctd8 hours ago|

parent|

[-]

That is all definitely amazing, but data storage is a fundamentally different process with far fewer constraints than continuous computation.

by ChrisMarshallNY6 hours ago|

parent|

[-]

It all uses the same miniaturization techniques, though.

by thesz16 hours ago|

parent|

prev|

[-]

800 mm2, about 90mm per side, if imagined as a square. Also, 250 W of power consumption.

The form factor should be anything but thumbdrive.

by pfortuny16 hours ago|

parent|

[-]

mmmhhhhh 800mm2 ~= (30mm)2, which is more like a (biggish) thumb drive.

by thesz16 hours ago|

parent|

[-]

Thanks!

I haven't had my coffee yet. ;)

by pfortuny9 hours ago|

parent|

[-]

Shit happens :D

by bdangubic9 hours ago|

parent|

[-]

always after the coffee :)

by baq12 hours ago|

parent|

prev|

[-]

the radiator wouldn't be though

by beAroundHere17 hours ago|

prev|

[-]

That's the kind of hardware am rooting for. Since it'll encourage Open weighs models, and would be much more private.

Infact, I was thinking, if robots of future could have such slots, where they can use different models, depending on the task they're given. Like a Hardware MoE.

by NitpickLawyer14 hours ago|

parent|

[-]

> Since it'll encourage Open weighs models

Is this accurate? I don't know enough about hardware, but perhaps someone could clarify: how hard would it be to reverse engineer this to "leak" the model weights? Is it even possible?

There are some labs that sell access to their models (mistral, cohere, etc) without having their models open. I could see a world where more companies can do this if this turns out to be a viable way. Even to end customers, if reverse engineering is deemed impossible. You could have a device that does most of the inference locally and only "call home" when stumped (think alexa with local processing for intent detection and cloud processing for the rest, but better).

by yonatan807011 hours ago|

parent|

[-]

It's likely possible to extract model weights from the chip's design, but you'd need tooling at the level of an Intel R&D lab, not something any hobbyist could afford.

I doubt anyone would have the skills, wallet, and tools to RE one of these and extract model weights to run them on other hardware. Maybe state actors like the Chinese government or similar could pull that off.

by kilroy12314 hours ago|

prev|

[-]

This is what I've been wanting! Just like those eGPUs you would plug into your Mac. You would have a big model or device capable of running a top-tier model under your desk. All local, completely private.

by 8cvor6j844qw_d617 hours ago|

prev|

[-]

A cartridge slot for models is a fun idea. Instead of one chip running any model, you get one model or maybe a family of models per chip at (I assume) much better perf/watt. Curious whether the economics work out for consumer use or if this stays in the embedded/edge space.

by sixtyj15 hours ago|

parent|

[-]

Plug it into skull bone. Neuralink + slot for a model that you can buy in s grocery store instead of prepaid Netflix card.

by pennomi8 hours ago|

parent|

[-]

We better solve the energy usage and cooling first otherwise that will be a very spicy body mod.

by Someone15 hours ago|

prev|

[-]

Would somewhat work except for the power usage.

I doubt it would scale linearly, but for home use 170 tokens/s at 2.5W would be cool; 17 tokens/s at 0,25W would be awesome.

On the other hand, this may be a step towards positronic brains (https://en.wikipedia.org/wiki/Positronic_brain)

by Onavo17 hours ago|

prev|

[-]

Yeah maybe you can call it PCIe.