undefined

points

by Aurornis7 hours ago |

comments

by lr4444lr7 hours ago|

[-]

At this point, I trust LLMs to come up with something more secure than the cheapest engineering firm for hire.

by nozzlegear5 hours ago|

parent|

[-]

"Anyone else out there vibe circuit-building?"

https://xcancel.com/beneater/status/2012988790709928305

by godelski25 minutes ago|

parent|

[-]

Is there more context to this? I'm assuming Ben is experimenting and demonstrating the danger of vibe circuit designing? Mostly because I know he has a ton of experience and I'd expect him to not make this mistake (also seems like he told the AI why it was wrong)

by alexjplant1 hours ago|

parent|

prev|

[-]

People make these mistakes too. Several times in my high school shop class kids shorted out 9V batteries trying to build circuits because they didn't understand how electronics work. At no point did our teacher stop them from doing so - on at least one occasion I unplugged one from a breadboard before it got too toasty to handle (and I was/am an electronics nublet). Similarly there was also a lot of hand-wringing about the Gemini pizza glue in a world where people do wacky stuff like cook fish in a dishwasher or defrost chicken overnight on the counter or put cooked steak on the same plate it was on when raw just a few minutes prior.

LLMs are just surfacing the fact that assessing and managing risk is an acquired, difficult-to-learn skill. Most people don't know what they don't know and fail to think about what might happen if they do something (correctly or otherwise) before they do it, let alone what they'd do if it goes wrong.

by nozzlegear45 minutes ago|

parent|

[-]

Well said, but I'd add that LLMs are also surfacing the fact that there's a swathe of people out there who will treat the machines as more trustworthy than humans by default, and don't believe they need to do any assessment or risk management in the first place.

by godelski31 minutes ago|

parent|

prev|

[-]

What's your point?

The AI is being sold as an expert, not a student. These are categorically different things.

The mistake in the post is one that can be avoided by taking a single class at a community college. No PhD required, not even a B.S., not even an electricians certificate.

So I don't get your point. You're comparing a person in a learning environment to the equivalent of a person claiming to have a PhD in electrical engineering. A student letting the magic smoke escape from a basic circuit is a learnable experience (a memorable one that has high impact), especially when done in a learning environment where an expert can ensure more dangerous mistakes are less likely or non existent. But the same action from a PhD educated engineer would make you reasonably question their qualifications. Yes, humans make mistakes but if you follow the AI's instructions and light things on fire you get sued. If you follow the engineer's instructions and set things on fire then that engineer gets fired likely loses their license.

So what is your point?

by bethekidyouwant11 minutes ago|

parent|

[-]

No one thinks their breadboard wont catch on fire because an AI agent told them it wouldn’t. Its never been easier to learn because of these agents.

by azan_31 minutes ago|

parent|

prev|

[-]

What’s wrong with dishwasher salmon?

by Aurornis7 hours ago|

parent|

prev|

[-]

The cheapest engineering firms you hire are also using LLMs.

The operator is still a factor.

by jama2116 hours ago|

parent|

[-]

Yeah, but they’ll add another layer of complexity over doing it yourself

by Aurornis6 hours ago|

parent|

[-]

The people doing these kickstarters are outsourcing the work because they can’t do it themselves. If they use an LLM, they don’t know what to look for or even ask for, which is how they get these problems where the production backend uses shared credentials and has no access control.

The LLM got it to “working” state, but the people operating it didn’t understand what it was doing. They just prompt until it looks like it works and then ship it.

by caminante5 hours ago|

parent|

[-]

You're still not following.

The parents are saying they'd rather vibe code themselves than trust an unproven engineering firm that does(n't) vibe code.

by TeMPOraL3 hours ago|

parent|

[-]

> they'd rather vibe code themselves than trust an unproven engineering firm

You could cut the statement short here, and it would still be a reasonable position to take these days.

LLMs are still complex, sharp tools - despite their simple appearance and proteststions of both biggest fans and haters alike, the dominating factor for effectiveness of an LLM tool on a problem is still whether or not you're holding it wrong.

by Kiro4 hours ago|

parent|

prev|

[-]

LLMs definitely write more robust code than most. They don't take shortcuts or resort to ugly hacks. They have no problem writing tedious guards against edge cases that humans brush off. They also keep comments up to date and obsess over tests.

by thayne42 minutes ago|

parent|

[-]

> They don't take shortcuts or resort to ugly hacks.

That hasn't, universally, been my experience. Sometimes the code is fine. Sometimes it is functional, but organized poorly, or does things in a very unusual way that is hard to understand. And sometimes it produces code that might work sometimes but misses important edge cases and isn't robust at all, or does things in an incredibly slow way.

> They have no problem writing tedious guards against edge cases that humans brush off.

The flip side of that is that instead of coming up with a good design that doesn't have as many edge cases, it will write verbose code that handles many different cases in similar, but not quite the same ways.

> They also keep comments up to date and obsess over tests.

Sure but they will often make comments or tests that aren't actually useful, or modify tests to succeed instead of fixing the code.

One significant danger of LLMs is that the quality of the output is higly variable and unpredictable.

That's ok, if you have someone knowledgeable reviewing and correcting it. But if you blindly trust it, because it produced decent results a few times, you'll probably be sorry.

by godelski8 minutes ago|

parent|

[-]

  > Sure but they will often make comments or tests that aren't actually useful, or modify tests to succeed instead of fixing the code.

I've been deeply concerned that there's been a rise of TDD. I thought we already went through this and saw its failure. But we're back to we're people cannot differentiate "tests aren't enough" from "tests are useless". The amount of faith people put into tests is astounding. Especially when they aren't spending much time analyzing the tests and understanding their coverage.

by godelski15 minutes ago|

parent|

prev|

[-]

  > They don't take shortcuts or resort to ugly hacks.

My experience is quite different

  > They have no problem writing tedious guards against edge cases that humans brush off.

Ditto.

I have a hard time getting them to write small and flexible functions. Even with explicit instructions about how a specific routine should be done. (Really easy to produce in bash scripts as they seem to avoid using functions, but so do people, but most people suck at bash) IME they're fixated on the end goal and do not grasp the larger context (which is often implicit though I still find difficulty when I'm highly explicit. Which at that point it's usually faster to write myself)

It also makes me question context. Are humans not doing this because they don't think about it or because we've been training people to ignore things? How often do we hear "I just care that it works?" I've only heard that phrase from those that also love to talk about minimum viable products because... frankly, who is not concerned if it works? That's always been a disagreement about what is sufficient. Only very junior people believe in perfection. It's why we have sayings like "there's no solution more permanent than a temporary fix that works". It's the same people who believe tests are proof of correctness rather than a bound on correctness. The same people who read that last sentence and think I'm suggesting to not write tests or believe tests are useless.

I'd be concerned with the LLM operator quite a bit because of this. Subtle things are important when instructing LLMs. Subtle things in the prompts can wildly change the output

by BoorishBears2 hours ago|

parent|

prev|

[-]

I had 5.3-Codex take two tries to satisfy a linter on Typescript type definitions.

It gave up, removed the code it had written directly accessing the correct property, and replaced it with a new function that did a BFS to walk through every single field in the API response object while applying a regex "looksLikeHttpsUrl" and hoping the first valid URL that had https:// would be the correct key to use.

On the contrary, the shift from pretraining driving most gains to RL driving most gains is pressuring these models resort to new hacks and shortcuts that are increasingly novel and disturbing!

by devmor4 hours ago|

parent|

prev|

[-]

Interesting and completely wrong statement, what gave you this impression?

by dylanowen4 hours ago|

parent|

[-]

I know right. I kept waiting for a sarcasm tag at the end

by majorchord4 hours ago|

parent|

prev|

[-]

right and wrong don't exist when evaluating subjective quantifiers

by Kiro4 hours ago|

parent|

prev|

[-]

The discourse around LLMs has created this notion that humans are not lazy and write perfect code. They get compared to an ideal programmer instead of real devs.

by joe_mamba2 hours ago|

parent|

[-]

This. The hacks, shortcuts and bugs I saw in our product code after i got hired, were stuff every LLM would tell you not to do.

by gxs3 hours ago|

parent|

prev|

[-]

Amen. On top of that, especially now, with good prompting you can get closer to that better than you think.

by salawat3 hours ago|

parent|

prev|

[-]

LLM's at best asymptotically approach a human doing the same task. They are trained on the best and the worst. Nothing they output deserves faith other than what can be proven beyond a shadow of a doubt with your own eyes and tooling. I'll say the same thing to anyone vibe coding that I'd say to programmatically illiterate. Trust this only insofar as you can prove it works, and you can stay ahead of the machine. Dabble if you want, but to use something safely enough to rely on, you need to be 10% smarter than it is.

by lukan7 hours ago|

parent|

prev|

[-]

And the cheapest engineering firm won't use LLMs as well, wherever possible?

by fc417fc8024 hours ago|

parent|

[-]

The cheapest engineering firm will turn out to be headed up by an openclaw instance.

by TheRealPomax7 hours ago|

parent|

prev|