undefined

upvote

points

by sharkjacobs1 days ago |

upvote

by zgao1 days ago|

[-]

Chip CEO here. It really depends on what "design" or "production" means. Does "design" mean that the design was complete? Does "production" mean the beginning of production, i.e. tapeout? If measuring from RTL-freeze to tapeout, this is a fairly typical (even somewhat unimpressive) timeline (accounting for some unexpected issues) for a large, complex 3nm chip. If measuring from concept (no RTL at all, block diagram of architecture) to tapeout, this is an amazing timeline. The truth is probably somewhere in between. A more concrete statement would use actual technical milestones and gates.

reply

upvote

by otterdude23 hours ago|

[-]

Not a chip CEO, but I read this article and thought that they're working on some kind of application specific chip only for serving models. Similar to how an FPGA can optimize certain tasks.

Given constant weights / biases of a Transformer / DNN you could use pipelining to feed forward calculations through the array one layer at a time. For DNN's with thousands of layers you might see 1:1 speed up per layer channel.

I doubt they would undergo this process for marginal gains.

reply

upvote

by kmacdough9 hours ago|

[-]

With a striking lack of numbers, I'm not confident. I my experience, everything underspecified in a marketing release is unflattering. They're also not a chip designing company, but they're probably trying to keep up on the eyes of investors. As the article mentions, several of their competitors are chip designers and already have working procuction inference chips.

reply

upvote

by SwellJoe9 hours ago|

[-]

When you have a few billion dollars you can hire chip people and partner with a chip company.

That's not to say I expect they'll ship something competitive with Google's custom AI hardware on the first go, since Google has been at it for quite a while, but there's very few technical problems large sums of money won't solve.

reply

upvote

by IX-1037 hours ago|

[-]

Yeah, I'm not sure how competitive it is without any specs. Just from it being "inference only" that puts it on the same level as Google's 2015 TPUv1.

reply

upvote

by zgao21 hours ago|

[-]

Yes, my statement was not about the quality or performance of the chip -- simply the tapeout timeline that was stated, by itself.

reply

upvote

by xdavidliu23 hours ago|

[-]

i don't understand what the second paragraph is saying.

reply

upvote

by nine_k22 hours ago|

[-]

In very crude terms, AFAICT, if you have a bunch of matrix multiplications, but one of matrices (the one with model weights) doesn't change, you can seriously speed up the computation. One thing is that you don't need to re-fetch the elements of the constant matrix, you can keep it near the ALUs. Then you maybe can detect and ignore sparse / empty blocks by marking them once.

IDK how the custom hardware exploits this; would love to hear any ideas!

reply

upvote

by guyomes21 hours ago|

[-]

> IDK how the custom hardware exploits this; would love to hear any ideas!

You might like this article [1], titled "FPGA-based CNN Acceleration using Pattern-Aware Pruning". More context and details can be found in the PhD thesis of Léo Pradels [2].

[1]: https://inria.hal.science/hal-04689673/document

[2]: https://theses.hal.science/tel-05021575v1/file/PRADELS_Leo.p...

reply

upvote

by fulafel3 hours ago|

[-]

Current accelerators (TPUs, various onchip NPUs) are something close to this. Systolic array is the estabilished computer architecture term for flowing data from computation to computation without the overhead of a register file or von Neumann bottleneck.

reply

upvote

by cm218719 hours ago|

[-]

Random thought. Once models stabilise, could you possibly hardcode the model in gates? Or are they too large for a single chip?

reply

upvote

by 8note19 hours ago|

[-]

https://www.anuragk.com/blog/posts/Taalas.html

reply

upvote

by lsaferite14 hours ago|

[-]

https://taalas.com/

reply

upvote

by jwHollister13 hours ago|

[-]

wow if they can get something like this working, what happens to all this infrastructure? Hyperscalers have to be assuming the lifespan of that stuff wrong considering the next gen will be 1000x more efficient.

reply

upvote

by otterley13 hours ago|

[-]

The question isn’t whether it works (it does); the question is whether there are buyers for hardware that is obsolete the day it ships. Models evolve much more quickly than hardware can keep up.

reply

upvote

by simondotau12 hours ago|

[-]

Presumably at some point the rapid progress of models will plateau, at least insofar as a model could be frozen in time and remain economically useful for the expected life of hardware. Especially if it comes with compelling benefits e.g. dramatically lower latency and/or dramatically higher performance per watt.

If you can build chips that could run one specific LLM 100x faster than anything else, it would have a use case that nothing else could match.

reply

upvote

by lsaferite5 hours ago|

[-]

Those taalus chips apparently run at 1/10 the power as the current SOTA GPU setups. If they can execute even partially on their plan, it'll be a literal game changer.

reply

upvote

by fragmede11 hours ago|

[-]

https://www.cerebras.ai/ is exactly that! Holy shit it's fast.

reply

upvote

by otterley6 hours ago|

[-]

Cerebras is not that. Cerebras isn’t tied to a particular model like Taalas is. The latter is even faster than Cerebras.

reply

upvote

by wrsh076 hours ago|

[-]

Right, but there exist problems that need to be routinely solved and can be solved on glm 5.2. is the model state of the art when it is published? No. But when it comes out you could optimize it and let your solver run forever for quite cheap, and that could be useful if the only problems you want it to solve (for cheap) are solvable by that model.

And the high water mark of what can be solved by open models will keep going up.

reply

upvote

by indigo94510 hours ago|

[-]

One obvious use case is edge computing, such as in industrial applications that cannot tolerate the risk of a network link or cloud service going down. Even embedded use cases are possible, such as an image classifier model in a security camera.

reply

upvote

by cm21879 hours ago|

[-]

In fact any application where the task is stable and the model good enough to address that task. As you suggest, industrial applications where a robot must deal with variants of the same repetitive task. Or a military drone which needs to be jamming proof.

reply

upvote

by Someone8 hours ago|

[-]

> Or a military drone which needs to be jamming proof.

That, if used in war, I would think, would need the ability to be updated frequently. For example, your enemy might find out (say by running tests on hardware they captured from you) that painting some red paint in a particular shape (a smiley might even work) on their hardware prevented your drones from attacking them because it confuses that pattern with the Red Cross logo.

reply

upvote

by 5423542342352 hours ago|

[-]

Those are really two different things. One is the computer vision that could be “hard coded” and the other is the image library, that would be updated regularly. Look at facial recognition. You can download and run a facial recognition LLM on your GPU that looks at a library of your personal photos. The LLM doesn’t change when it scans your photos for faces, it just writes the data associated with a “face” to whatever library. When you add a new picture, it adds that face data and compares it to the library for a match. The actual LLM never needs to change. It is the same as the one I downloaded and ran on my GPU for my photos. If it was written on chips we both bought and installed, it would work the same way.[1]

[1] Yes, this is a massive simplification

reply

upvote

by TeMPOraL4 hours ago|

[-]

You keep the "reasoning core" burned and play the cat-and-mouse game at the I/O edge. Enemy invents a smiley shield, your R&D figures out some filtering step that defeats this effect without compromising general image recognition. Then the enemy figures out a new trick, your R&D invents a countermeasure, and so on - point is, this can happen for a long time in layers on top of the core model. If the enemy invents some robust way to attack the core that cannot be filtered out, it's game over for that hardware, but that is a much more difficult task and might take longer than expected service time of a given batch of drones.

reply

upvote

by SoftTalker2 hours ago|

[-]

Sort of mirrors how biological organisms work. E.g. in a bird, the core functionality of knowing how to fly is burned in. Hunting food is probably a combination of experiential learning on top of instinctive behavior, and is somewhat adaptable to local conditions.

reply

upvote

by largbae5 hours ago|

[-]

There may be all sorts of stable use case models that this could be interesting for. Imagine permanent voice translation circuits at a tiny fraction of the current price, glasses that subtitle the world with long battery life.

reply

upvote

by lsaferite6 hours ago|

[-]

They are betting on fast release cycles coupled with much lower costs (purchase and operations) mixed with the ability to have dynamic fine tunes on top of the static model.

reply

upvote

by SwellJoe9 hours ago|

[-]

The models have to run on something or they're useless. They can't run on future hardware today, and people want to use models today. So, if hardware is obsolete the day it ships, we're all using obsolete hardware, and there's no alternative to that.

reply

upvote

by otterley6 hours ago|

[-]

Taalas encodes the model into the hardware itself. The two are inextricably coupled. It’s like buying a CNC router that can’t be reprogrammed to build anything other than a specific predetermined kitchen cabinet. And the model used inside is frozen many months before the hardware ships, since the process from tapeout to production takes that long.

In contrast, tomorrow’s models will typically run, although perhaps more slowly, on general-purpose inference hardware that was released today or even years ago.

reply

upvote

by otterdude22 hours ago|

[-]

Basically getting around the branch predictor problem with generalized compute architectures https://en.wikipedia.org/wiki/Branch_predictor

reply

upvote

by pama17 hours ago|

[-]

If you look at the timelines for the hiring of the hardware team, this was an extremely fast and high risk implementation from concept to tapeout. Amazing it works at all during bringup.

reply

upvote

by nonethewiser23 hours ago|

[-]

>If measuring from RTL-freeze to tapeout, this is a fairly typical (even somewhat unimpressive) timeline (accounting for some unexpected issues) for a large, complex 3nm chip.

Even for a company’s first design?

reply

upvote

by hailwren23 hours ago|

[-]

I don't think you get the newcomer novelty buff when your val approaches 13 digits.

reply

upvote

by RugnirViking21 hours ago|

[-]

Big companies are lumbering behemoth, crude assemblages of barely cobbled-together incentives and principal agent problems in a trenchcoat. Getting them to change direction, or worse, try something new at scale, is a massive undertaking

reply

upvote

by mlinhares20 hours ago|

[-]

Nah, you just need to get the CEO behind it. Most coordination issues get solved when the CEO is breathing down your neck to get something done. Trouble is that they don't do this enough.

reply

upvote

by eru12 hours ago|

[-]

CEOs have limited bandwidth, and can only breath down so many necks at once.

reply

upvote

by NBJack19 hours ago|

[-]

Eh, zero guarantees on that one.

The Fire Phone was Jeff Bezos' personal baby, and we know how that went. Then there was the Apple G4 Cube with Steve Jobs, the Model X' Falcon Wing doors and Elon, and lets not even talk about the Metaverse and Zuck.

reply

upvote

by aleph_minus_one19 hours ago|

[-]

> The Fire Phone was Jeff Bezos' personal baby, and we know how that went.

I'd rather guess that Jeff Bezos' opinion on what makes a good phone is/was different on the opinion of many potential buyers.

reply

upvote

by AtlasBarfed2 hours ago|

[-]

An Amazon phone with Amazon Video, playing Amazon Music, making phone calls throug the Amazon messenger, with an Amazon Browser that overlays ads to Amazon products, and has Amazon Voice Recognition ... blah blah blah

I imagine when you are a billionaire from one company, every time you hear the name of the company you hear your name, so you can't really think about what Joe Schmoe wants in a phone independently of your ego.

I guess this is what Steve Jobs was better at. SOME focus on the customer independent of his ego and Apple Apple Apple. I did say ... SOME.

reply

upvote

by kQq9oHeAz6wLLS18 hours ago|

[-]

Actually, you've provided examples that prove the point. None of those were especially good (though everyone wanted the G4 Cube), and yet they made it to market anyway. Why?

Because the CEO was behind it, breathing down their necks.

reply

upvote

by NBJack13 hours ago|

[-]

Pretty much every example is considered an abysmal failure that often costed the actual workers their careers while their CEO carried on.

If you consider that outcome a worthwhile endeavor, I don't know what else to say.

reply

upvote

by inemesitaffia12 hours ago|

[-]

He's definitely not talking about worthy endeavour.

He's talking about an endeavour reaching the market.

I'm sure if Zuckerberg wants to spend $10B on Nuclear Fusion it will happen.

reply

upvote

by DANmode6 hours ago|

[-]

It’s fission, not fusion:

https://www.esgdive.com/news/meta-inks-nuclear-deals-terrapo...

…and if they do all of this, it’ll be closer to $20B than 10!

reply

upvote

by TeMPOraL4 hours ago|

[-]

If all it took to get viable fusion power was a FAANG CEO with $10B to burn, I'd be first to petition for it to happen, and even throw whatever money I can spare onto that pyre.

reply

upvote

by zgao21 hours ago|

[-]

The typical way a chip effort in a non-chip company works is that the "design" is the RTL (e.g. SystemVerilog that defines the behavior of the chip) and then this is handed off to a third-party "design house" (such as Broadcom) that turns that code into a real image of a chip, which is called a GDS (basically you can think of this as a very big layer by layer photoshop file) that can actually be sent to a fab. This is called "backend design", in contrast to the "frontend design" (the RTL itself).

As another commenter said, Broadcom is very experienced with backend design (as well as the supply chain management, testing, etc. that comes after the chip is taped out) and so this can't be regarded as a "first chip". Richard Ho (the head of hardware at OpenAI) is also extremely experienced and used to be the head of the Google TPU effort -- where he actually worked with Broadcom in a similar tapeout already. So yes, this is not a "first design"!

reply

upvote

by surajrmal19 hours ago|

[-]

I wonder if broadcomm borrowed IP between the Google tpu and this design. How would you ever know it didn't happen?

reply

upvote

by zgao16 hours ago|

[-]

There is no real way to prevent this, but there are ways to increase the cost of doing so. For example, one level of obfuscation is, OAI could internally run synthesis and adopt a “netlist-in” model in which Broadcom gets a netlist - a description of a huge amount of gates and wires and how they connect - instead of the plain Verilog (or other language). It is possible to reverse engineer the netlist, but it’s a certain level of indirection and effort.

A big part of the semiconductor industry also operates on a reputation basis. Broadcom (like TSMC) is a neutral party as a design house, but if they did something like this, it might ruin that reputation.

reply

upvote

by 14 hours ago|

[-]

deleted

reply

upvote

by kQq9oHeAz6wLLS18 hours ago|

[-]

More likely that the AI training set contained the IP of others, and we all know how that turns out.

reply

upvote

by formerly_proven23 hours ago|

[-]

This isn't Broadcom's first design.

reply

upvote

by swiftcoder22 hours ago|

[-]

Yeah, "first chip" here likely means they contracted Broadcom (or a firm with similar experience) to do all the heavy lifting. Building out your own in-house teams for this sort of thing is a decade-long project - just look how much inside Apple's early chips was licensed ARM / PowerVR cores

reply

upvote

by MisterTea21 hours ago|

[-]

Apple didn't have the talent in-house until they bought Intrincity who worked with Samsung on Apple's earlier Arm chips as well. https://en.wikipedia.org/wiki/Intrinsity

reply

upvote

by donavanm15 hours ago|

[-]

That’s not quite fair. As I recall there were about 1,500 people in that part of the hardware org circa mid 2000s. Before PA Semi there were pretty established teams already doing VLSI/PD/verification/validation, PCB, and of course analog/mixed hardware, in their own work and in conjunction with samsung, old broadcom, qualcomm, etc. Lots of inhouse work went in to all those bespoke monitors, phones, apple tv, airports, etc etc.

My recollection is that PA Semi was very much for the architectural and design talent, even though it was an “asset purchase” and all the existing Power & military chips were hived off.

For Intrinsity I recall a lot of interest was actually in their existing graphics work and EDA. ISTR that those early mobile GPUs were what they focused on.

I was in the mansfield org circa ‘07-11. I spent a lot of time flying between cupertino and austin/bee caves that first year.

reply

upvote

by selectodude21 hours ago|

[-]

I think the folks at PA Semi had some chops too.

reply

upvote

by reinitctxoffset19 hours ago|

[-]

The way I heard it PA Semi was the singular driving force that led to Apple Silicon, but I'm not any kind of insider that's just the chatter I heard.

Whoever it was, whooo, that's hot shit. I remember an M1 MacBook Air just cleaning the clock of an Intel MacBook Pro and thinking "x86_64 has real competition again".

Great silicon. I'm over it with not having root on my own machine, so I've left the ecosystem, but it's really nice hardware, can't dispute that.

reply

upvote

by markhahn2 hours ago|

[-]

it would be interesting to know apple's true/inside attitude towards people putting linux on their hardware. they don't seem very interested in helping, but donno whether they actively sabotage either.

reply

upvote

by re-thc17 hours ago|

[-]

> The way I heard it PA Semi was the singular driving force that led to Apple Silicon

And a lot of them are sitting under Qualcomm via the Nuvia acquisition.

reply

upvote

by stinkbeetle20 hours ago|

[-]

PA Semi group did the logic designs. I think they're talking about physical design though.

reply

upvote

by dndmfnfn17 hours ago|

[-]

[dead]

reply

upvote

by Aurornis1 days ago|

[-]

The hardware description languages (HDL) used in chip development are like programming languages. The existing models understand them and can do a lot with them. You don’t need to have separate, specialty models designed for this work to use LLMs in chip design workflows.

Design verification also involves a lot of traditional programming which benefits from LLMs.

So it’s not meaningless at all. You could download some of the open source chip design software today and the LLMs could even help you get started on your own tiny chip if you are so interested.

reply

upvote

by knicholes1 days ago|

[-]

I tried making a button using Claude entirely (including the 3D printed enclosure) and it effed up pretty hard with the traces and the header spacing. The project was a big red arcade button that plays the "ah-my-groin.mp3" when pushed (from Simpsons). It did cool work on saving battery life, and the 3d enclosure was awesome, but yeah, I'm convinced I'd have to do another version or two of the custom chip until it came back right. I used a Blender MCP for the 3d modeling. I used a KiCAD MCP server for the chip design/validation.

I think we're not there yet. I've been meaning to look at this flux.ai to see if it has the prompts/workflow worked out better than what I was able to cobble together in a few hours. Maybe Alteryx's MCP server would have been better. I'll try that this weekend for another board I've got.

reply

upvote

by Aurornis1 days ago|

[-]

> I tried making a button using Claude entirely (including the 3D printed enclosure) and it effed up pretty hard with the traces and the header spacing.

PCB design and 3D CAD design are different topics.

Hardware Description Languages are closer to programming languages than CAD. Look at some Verilog to get an idea - https://en.wikipedia.org/wiki/Verilog

reply

upvote

by knicholes1 days ago|

[-]

Right. KiCAD for PCB design. Blender for 3D CAD. Oh, are you saying I should have used something other than the KiCAD MCP server for better results?

reply

upvote

by VorpalWay22 hours ago|

[-]

Designing circuit board and 3D models (even using something like OpenSCAD) is a very spatial process today. You are dealing with coordinates one way or another.

This is very unlike how FPGA and (I assume) ASIC is done. That is more like a traditional programming language but everything happens all at once (no sequence of statements outside tests, if you need that you have to write a state machine yourself). You define logic expressions between signal, add stateful latches, etc. But you never specify the physical layout.

Instead you feed your description to a tool that acts a constraint solver/optimiser that computes the layout for you (this is for FPGAs called synthesising IIRC, it is akin to a compiler). Typically quite slow, even for small circuts like we did at university it took minutes, and for large circuits it might easily days.

Now, this raises the question, what if you design a PCB net list using AI, but then use traditional autorouting and layout? I believe that can also be done, but I have no experience designing PCBs, so I don't know how well it works.

reply

upvote

by dubbie999 hours ago|

[-]

Autorouting PCBs doesn’t really give usable results on all but the simplest cases. It seems to be a very difficult problem to solve even though a human doing it is only following a relatively simple bunch of rules and goals in his or her head.

reply

upvote

by TeMPOraL4 hours ago|

[-]

Simple bunch of rules and goals backed by extremely sophisticated visual intuition.

Pretty sure someone already tried throwing VLMs and diffusion models at this, wonder how that fared.

reply

upvote

by quadrature23 hours ago|

[-]

VHDL is not a language for spatial design. Its more akin to a programming language with circuit semantics.

reply

upvote

by y1n021 hours ago|

[-]

For digital chip design, research Verilog and/or SystemVerilog, and for tools, check out verilator and the OSS cad suite: https://github.com/YosysHQ/oss-cad-suite-build

reply

upvote

by dcrazy23 hours ago|

[-]

They’re saying that VHDL is an entirely different concept than physical modeling.

reply

upvote

by giancarlostoro23 hours ago|

[-]

You're comparing apples and oranges.

reply

upvote

by cwillu23 hours ago|

[-]

Meta: can we not downvote people who are clarifying what they're saying and asking questions, even if they're wrong about something, if the content isn't otherwise objectionable?

reply

upvote

by baq22 hours ago|

[-]

I didn’t downvote, but the OP is either a troll or someone who doesn’t want to notice he doesn’t know what he’s talking about. Either way we want less of that on HN.

reply

upvote

by knicholes22 hours ago|

[-]

I'll acknowledge that I don't know what I'm talking about. I really appreciated the clarity! Surely you find value in knowing that creating your own custom chips is almost doable by someone who doesn't know what they're talking about! (also, I am a troll, but in this case, just clueless)

reply

upvote

by Lukas_Skywalker22 hours ago|

[-]

Maybe the confusion stems from the word "chip". Creating a chip usually means designing and producing a microcontroller or a processor, not a printed circuit board that you populate with existing chips.

reply

upvote

by knicholes20 hours ago|

[-]

Ohhhhhh! Yes, that's exactly the problem. It all makes sense now. I was just piecing together an existing microcontroller and a mp3 module by printing a custom circuit board.

reply

upvote

by 20 hours ago|

[-]

deleted

reply

upvote

by tamimio23 hours ago|

[-]

One (kicad) make the board, the other (blender) make the casing for it. Both are “hardware” but is electronics and the other is mechanical. Electronic one AI can do a good job, I can’t wait for it to fully built the whole circuit for you based on your specs.

reply

upvote

by rpcope123 hours ago|

[-]

PCB layout is an art, and doesn't seem to map well to LLMs (I tried for shits and giggles recently). Claude in general, kind of like code, does a lot of redundant belt and suspenders stuff in the schematics it generates (if it can generate them at all). It's one of those things that's really not there yet outside of the simplest designs.

reply

upvote

by BioGeek18 hours ago|

[-]

DeepPCB has an AI autorouter [1] that uses reinforcment learning and works really well. Recently they also released an AI agent that analyzes your board, proposes plans and can route your board for you [2]. They have a KiCad plugin [3] and you can try it for free.

[1] https://deeppcb.ai/reinforcement-learning-pcb-routing-explai... [2] https://deeppcb.ai/cooper/ [3] https://deeppcb.ai/deeppcb-kicad-plugin-ai-pcb-routing/

Disclaimer: I work at InstaDeep, the company behind DeepPCB, but I don't work on this product.

reply

upvote

by chamomeal14 hours ago|

[-]

Sounds like a super cool project. Gonna post the design anywhere?

reply

upvote

by knicholes1 hours ago|

[-]

I'll update it this weekend with the updated AI-generated fun (and correct the flat-out ai-generated lies in the README). Meanwhile, you can see the project here. https://github.com/knicholes/ah-my-groin-button

reply

upvote

by ses19841 days ago|

[-]

The question isn’t whether or not they employed a particular tool, the question is how big of an impact did it have.

reply

upvote

[-]

deleted

reply

upvote

by nradov23 hours ago|

[-]

Most HDL code is locked up behind corporate firewalls and not available as training data. While LLMs can handle it to an extent there's a lot of room for improvement. I'll bet that OpenAI and their competitors are racing to license this IP from major hardware vendors in order to compete in the chip design vertical.

reply

upvote

by tonfa21 hours ago|

[-]

Does it work better when using compiler based ecosystem (e.g. https://github.com/llvm/circt)

reply

upvote

by bsder20 hours ago|

[-]

There is quite a lot of Verilog/SystemVerilog and VHDL code in the wild. And hardware description language code is very simple and straightforward relative to programming code.

And the two things that take up VAST amounts of time in ASIC design are testbenches and timing closure.

A LOT of hardware design is testbenches to verify things. AI is REALLY GOOD at generating things like testbenches. And nobody really cares if the quality of your testbench code sucks as long as it validates what it claims to.

I don't know how good AI is at timing closure, but I wouldn't necessarily be surprised if it is pretty good at it up to the physical point. That's lots of textual output which you can put a constraint on.

Everything involving physical design, though, tends to be a disaster waiting to happen if you let AI loose on it.

reply

upvote

by doxeddaily23 hours ago|

[-]

This reminds me of the dude on youtube building a chip fab in his shed.

reply

upvote

by einpoklum6 hours ago|

[-]

> The existing models understand them

No they don't.

reply

upvote

by holoduke20 hours ago|

[-]

One day we can design our own pcb with chips, hardware and other io. Companies will accept these as files and you can collect your pcb the same day. I think in China they are doing this already

reply

upvote

by remexre17 hours ago|

[-]

hasn't pcbway been doing this for years?

reply

upvote

by IshKebab1 days ago|

[-]

> The existing models understand them and can do a lot with them.

In my experience they are not especially good at SystemVerilog. There's a lot of knowledge about it that is locked behind paywalls and it's very niche.

My guess is the "from scratch" here is quite the exaggeration. Otherwise why did they need Broadcom?

reply

upvote

by whynotminot1 days ago|

[-]

Doesn’t Broadcom bring a lot more to bear here than just Verilog? Including relationships with the actual fabricators.

reply

upvote

by IshKebab11 hours ago|

[-]

I doubt that is really significant - fabs are happy to work with anybody. What they will likely bring is:

* Physical design team (stupidly known as the "backend"). This is extremely specialised knowledge and most chip companies don't really want to have to deal with it if they can avoid it.

* IP blocks. Especially for annoying things like phys, memory controllers, USB controllers, PLLs, power, etc. These things are difficult to do, difficult to test, and often critical (good luck if your clock doesn't work...) I would not at all be surprised if Broadcom supplied CPUs too.

My total guess at what happened is Broadcom supplied most of a SoC and OpenAI added an LLM coprocessor module to it, and probably asked them to add like 10x more DRAM interfaces.

reply

upvote

by aseipp22 hours ago|

[-]

Not having a free toolchain that can actually handle the real language has probably been pretty bad on the downstream public knowledgebase. Hopefully Verilator can eventually close that hole, and there can be more high-quality designs and codebases incorporated into future models. Claude is at least good enough to write SV that triggered a compiler crash or two. :)

reply

upvote

by cloudengineer948 hours ago|

[-]

Broadcom also has direct allocation with TSMC, which is a big win

reply

upvote

by 21 hours ago|

[-]

deleted

reply

upvote

by aurareturn22 hours ago|

[-]

Broadcom already has a ton of IP for AI SoCs. I'm guessing the hard parts of this inference chip was already designed by Broadcom and OpenAI simply told Broadcom what it wanted. It's likely very similar to Google's TPU.

  Early testing shows that the first-generation accelerator will deliver performance per watt substantially better than current state-of-the-art

What is substantial here? Vera Rubin is shipping in volume later this year and it is expected to be 10x more power efficient for inference than Blackwell.[0] Even if they're already taped out the chip, getting bugs fixed, getting chips manufactured, getting HBM allocation, getting a rack design, hooking them up together, putting them in a data center will likely take at least another 12 months or likely more. By the time this chip is in data centers in volume, they're likely competing against Vera Rubin Ultra or maybe even Feynman.

Personally, I don't think OpenAI should have invested in this project. It's too early for them. They should have focused on models like Anthropic and win there. When they're profitable, they can take on these projects.

The risk here is very high for OpenAI because AI has a hard cap in energy. If you have a gigawatt, you should only install the best chips. If Nvidia's chips are better, then this is a wasted project and likely wasted billions.

[0]https://developer.nvidia.com/blog/scaling-token-factory-reve...

reply

upvote

by cptskippy22 hours ago|

[-]

Why do you assume Broadcom has a ton of IP for AI SoCs but hasn't done any of the other work around data center scale deployments?

reply

upvote

by aurareturn22 hours ago|

[-]

They have. That's why OpenAI was able to get a working demo in 9 months. But going from a small scale system to a full fledged data center deployment is likely much harder.

I don't know how much of the things outside of the chip Broadcom has vs Google's proprietary tech that is not shared with Broadcom.

Nvidia's Vera Rubin has 6 unique chips working together in a single rack.[0]

[0]https://developer-blogs.nvidia.com/wp-content/uploads/2026/0...

reply

upvote

by threecheese21 hours ago|

[-]

I’m just happy to see diversity here; sometimes I feel like Nvidia is going to eat the world, with buying other fabs and branching out - or up, I guess - from chips and racks to models, frameworks, and end user stuff.

reply

upvote

by surajrmal19 hours ago|

[-]

I thought most of the Google tpu magic is on wiring up these chips into supercomputer like clusters with specialized interconnects and whatnot. The chips themselves are less interesting in isolation.

reply

upvote

by luma17 hours ago|

[-]

I know nothing of what is happening here but Broadcom has a lot of IP in high speed/low latency data transfer from chip to datacenter scales.

reply

upvote

by AtlasBarfed2 hours ago|

[-]

"Substantial" seems like a damning word.

So one of my pet theories I haven't seen in general discourse is that AI came from the massive vector processing jump available commercially in GPUs when it left CPU bound processing behind. That's a factor of 100x-1000x of processing power.

AI is not-quite-there, and to get even another leap might take another 10-100x processing power.

Now... what? ASICs probably won't deliver even a 10x? There's only so much you get out of node shrinks.

"Substantial" doesn't even mean twice IMO. "Substantial" almost sounds like ... 15% better?

reply

upvote

by dofm1 days ago|

[-]

Right. There are two possible meanings and shades in-between:

1) OpenAI genuinely have AI technologies that can improve chip design (bold, unlikely claim, needs evidence)

2) OpenAI designed test/verification models and kernels that could be run on the simulated hardware to test its performance

As you and others have said, it's hard to trust when they are happy to write something that could easily only mean the latter but sounds like the former.

reply

upvote

by lovasoa1 days ago|

[-]

3) The engineers working on the chip used ChatGPT from time to time.

reply

upvote

by Catloafdev1 days ago|

[-]

I'd be shocked if it was anything more than this.

reply

upvote

by changoplatanero1 days ago|

[-]

Browsing openai's job postings in the past few months is enough to contirm that it's more than this. They are for sure making serious efforts at building ai for chip design.

reply

upvote

by xnx23 hours ago|

[-]

Impossible to know. Could be fake/aspirational roles to impress investors with their grand vision.

reply

upvote

by NitpickLawyer22 hours ago|

[-]

Jesus. This is tinfoil hat territory now. Why would they fake something like that? ANY company in this field would try to become free from nvda. Goog has done it already, amazon has their own thing, so it can be done. Not saying they'll 0shot this vertical, but ffs, they don't need to fake anything. They are making an effort, and it would be insane to think they aren't. Might work, might not work, but to even think that the effort is fake is going too far.

reply

upvote

by kdheiwns8 hours ago|

[-]

Asking "why would a company lie?" is probably the funniest thing I've seen all month. Every big company puts out BS nonstop.

reply

upvote

by Planktonne21 hours ago|

[-]

They have a history of lying and making grandiose claims. It's unreasonable to extend them the benefit of the doubt again.

reply

upvote

by reinitctxoffset19 hours ago|

[-]

It kinda depends on what your prior is. Some companies do a press release and I immediately pay attention or even take action.

Other companies? Fool me once Altman, let's see the thing at scale making money.

Near frontier AI is clearly relevant to some kinds of logic design, I'm learning some Hardcaml at the moment and yeah, AI is super helpful.

Can it leapfrog a company without hardware experience to near the front of the pack of companies with decades of hardware experience? Less obvious.

Unrelatedly, would OpenAI dramatically overstate something to manipulate the press and public and capital markets?

It's arguably their core competency .

AI is going to matter in logic design and synthesis. How much, how soon, and where are open questions.

reply

upvote

by luqtas22 hours ago|

[-]

https://antoniocortes.com/en/2026/03/10/ghost-jobs-the-econo...

reply

upvote

by NitpickLawyer22 hours ago|

[-]

I'm not saying this isn't a thing. I'm saying oAI doesn't need to fake trying to make a chip or hiring people to make AI better at chip making, or dogfooding or anything like this. It's obvious they're doing it. They'd have 0 reason to fake something like this "for the investors". Come on!

reply

upvote

by signatoremo23 hours ago|

[-]

Do you have inside knowledge?

reply

upvote

by fl4regun1 days ago|

[-]

at the hardware company I work at, people are now using claude code and developing skills for it to do basic stuff like triage or do initial debug on failing tests, search for potential causes in RTL, generate skeleton documentation for designs etc

reply

upvote

by dofm23 hours ago|

[-]

But isn't this rather the ordinary product of an LLM, now?

Is it worth the claim that they are making in a press release?

reply

upvote

by girvo20 hours ago|

[-]

> Is it worth the claim that they are making in a press release?

Definitely, yes, because being vague about it like they have been lets investors fill-in-the-blanks with whatever they want it to mean.

reply

upvote

by reducesuffering1 days ago|

[-]

From time to time? Lol you must realize, frontier lab eng are using Codex/Claude-Code 99% in loops, on models the public doesn't have access to. Why? Because it works. Just a matter of time before humans are out of the loop and what comes next is a black hole

"The future is here, it's just not evenly distributed"

reply

upvote

by wongarsu1 days ago|

[-]

Or OpenAI accelerated the design and optimization process by summarizing emails exchanged during the design and optimization process, or made it possible to ask an AI questions about meeting notes

reply

upvote

by Aurornis1 days ago|

[-]

> 1) OpenAI genuinely have AI technologies that can improve chip design (bold, unlikely claim, needs evidence)

Chip design languages (HDLs like Verilog or VHDL) are well understood by LLMs. They don’t need specialty tools to use GPT-5.5 or other LLMs with them.

You could even try it yourself with open source chip design tooling if you wanted to see it.

reply

upvote

by dofm1 days ago|

[-]

Yes, obviously. But do we think LLMs without access to proprietary information do a better job with them than Broadcom's human experts or existing proprietary tools at this level of operations?

It is still a bold claim and it still needs evidence.

We would obviously get a bit more of the evidence if it were to be more useful for the upcoming IPO than this rather open-ended, reinterpretable phrasing.

reply

upvote

by fc417fc80217 hours ago|

[-]

> do a better job with them than Broadcom's human experts or existing proprietary tools

No, obviously. They'd be expected to do a substantially worse job and yet still drastically accelerate the design process.

LLMs make all sorts of dumb mistakes when writing c++ or python yet are nonetheless massively beneficial.

reply

upvote

by dpe821 days ago|

[-]

I don't understand why you're getting downvoted.

I've used GPT-5.5 and Opus both for FPGA design with good results. We built a lot of tooling around it to help the models, but even without that they're definitely capable of designing digital logic.

reply

upvote

by dmitrygr22 hours ago|

[-]

My guess: it is that those who KNOW the subject realize that LLMs suck at it, and those who do not, do not realize it, since their output is plausible, and sometimes even works.

This actually plays out across every field and is well documented. An expert can recognize the hallucinations and bullshit coming out of LLMs, while non-experts see plausible output and do not know enough to know it is BS.

reply

upvote

by stevenhuang20 hours ago|

[-]

Wrong. Myself and colleagues know the subject and they are useful in FPGA design. You should stop hallucinating about topics you don't have experience in.

reply

upvote

by wmf1 days ago|

[-]

https://dl.acm.org/doi/10.1145/3785362

https://developer.nvidia.com/culitho

https://www.synopsys.com/blogs/chip-design/analog-layout-syn...

https://arxiv.org/abs/2302.06415

reply

upvote

by etempleton23 hours ago|

[-]

I feel like they would be very specific if it was no.1.

reply

upvote

by scrollop1 days ago|

[-]

Perhaps they used gpt 5.5 mini to draft emails. Create a coffee schedule.

reply

upvote

by oceanplexian1 days ago|

[-]

> OpenAI genuinely have AI technologies that can improve chip design (bold, unlikely claim, needs evidence)

Why is that a bold and unlikely claim?

Are you saying that AI, which has been proven to cure diseases, solve our hardest math problems, write complex computer code and generate entire generated worlds and HD video from a simple prompt would somehow be like, my bad, I guess I can't design chips?

reply

upvote

by smokel1 days ago|

[-]

> solve our hardest math problems

We're not quite there yet :)

https://en.wikipedia.org/wiki/List_of_unsolved_problems_in_m...

reply

upvote

by dofm23 hours ago|

[-]

> Why is that a bold and unlikely claim?

Because they could have offered even slightly more evidence.

reply

upvote

by cess111 days ago|

[-]

Because then they'd likely have stfu and outperformed Intel, Nvidia and AMD, or at least one of them.

They're burning more cash than pretty much anyone else and doesn't have anything public that looks like a matching revenue stream so they probably need one very badly.

reply

upvote

by nullsanity23 hours ago|

[-]

[dead]

reply

upvote

by nixon_why691 days ago|

[-]

There is a lot of verilog out there, it's pretty feasible that they had AI assistance writing more to design their chip.

It doesn't have to be revolutionary, it could just be AI-assisted design and lined up well enough with their operations for a custom ASIC to be worth it.

reply

upvote

by KeplerBoy1 days ago|

[-]

Also there's some much boilerplate around everything. Writing a testbench with codex is extremely feasible. This is the kind of verifiable feedback loop the agents shine at.

reply

upvote

by blitzar22 hours ago|

[-]

> the use of email, spam filters and spellchecker to accelerate parts of the design and optimization process

honestly you don't realise how much more efficient it is until you are stuck using the wrong flavour of outlook, the spam filter breaks or sloppy spelling, punctuation and grammar force you to clarify details needlessly.

reply

upvote

by nickvec22 hours ago|

[-]

I feel like "the use of OpenAI models to accelerate parts of the design and optimization process" just means that engineers were using ChatGPT to sanity check their designs and suggest potential optimizations, though that's just my take (and I'm quite cynical about AI marketing in general!)

reply

upvote

by u1hcw9nx8 hours ago|

[-]

Written with AI is the new written in Rust. Both are nonsensical statements and tell noting about the quality of the software.

Without context, both are warnings about the quality of the developers.

reply

upvote

by Kiro9 hours ago|

[-]

I think this kind of "hard work" is a perfect fit for AI, and something where the complexity for a human is incorrectly extrapolated to LLMs.

Tirelessly wading through heaps of specifications and documentation with very clear goal definitions is hard for a human but easy for an AI. Meanwhile, taking UX and edge cases into account in a business application is easy for a human but hard for an AI.

reply

upvote

by SCUSKU22 hours ago|

[-]

My girlfriend works at Broadcom doing chip design, and based on what she's told me they JUST got claude code like 3 weeks ago, so I really doubt this means anything beyond them vibe coding some scripts or something...

reply

upvote

by 17 hours ago|

[-]

deleted

reply

upvote

by 22 hours ago|

[-]

deleted

reply

upvote

by figassis1 days ago|

[-]

VHDL, VLSI are well documented languages, with well build test and verification frameworks and harnesses. Even just by iteration you could get there if you have the money to pay for it.

reply

upvote

by FanaHOVA1 days ago|

[-]

NVIDIA already designs most of their chips using AI. Why would you assume it's meaningless marketing?

reply

upvote

by fecal_henge1 days ago|

[-]

Perhaps because they are suggesting what they are doing is novel.

reply

upvote

by DoctorOetker23 hours ago|

[-]

novel to whom, the reader or the industry?

something can be non-novel in the industry, yet novel to the reader, at which point it is useful ... for such readers.

reply

upvote

by nullsanity1 days ago|

[-]

[dead]

reply

upvote

by seydor1 days ago|

[-]

realistically, how hard are AI accelerators to design?

reply

upvote

by WithinReason9 hours ago|

[-]

The hardware? Not too difficult, there are dozens of startups. The software? Only NVIDIA could do it so far sufficiently well.

reply

upvote

by sentinalien6 hours ago|

[-]

How many profitable startups are there?

reply

upvote

by WithinReason5 hours ago|

[-]

0 because they lack the software, not the HW. The HW works and is relatively easy to make.

reply

upvote

by therealcamino15 hours ago|

[-]

Uh, pretty hard?

reply

upvote

by HarHarVeryFunny23 hours ago|

[-]

I would assume they've already made as big a deal of it as they can without outright lying too much. Read the rest of the press release.

FWIW, Google is now on their 8th generation TPU, having put out the last 4 generations on a 1-year cadence.

reply

upvote

by davidpapermill3 hours ago|

[-]

> Google is now on their 8th generation TPU

Remarkable that the TPU pre-dates the attention paper. Was a solid bet on energy efficient dense matrix multiplication and has stood the test of time.

reply

upvote

by HarHarVeryFunny14 minutes ago|

[-]

Yes, they'd been using machine learning internally since c.2011 with their DistBelief system, which was then replaced with TensorFlow in 2015.

reply

upvote

by vanyaland11 hours ago|

[-]

[dead]

reply

upvote

by napierzaza20 hours ago|

[-]

[dead]

reply

upvote

by xnx1 days ago|

[-]

AlphaChip is what a chip design with AI is. I'm very suspicious that OpenAI has anything like this or they would be bragging about it.

https://deepmind.google/blog/how-alphachip-transformed-compu...

reply