upvote
If the guardrails were so useless, people wouldn't be complaining about them.
reply
People are generally complaining about false positives. Now if you really wanna know what a real criminal organization would do... They'd just buy data center hardware even if it costs 200k because a successful targeted hit could yield far in excess of that. So yes it's speed bump at best.
reply
> it's speed bump at best

To be fair, speed bumps work. If it's actually speed bumping nefarious activity, that gives authorities more time to react.

The correct place to police rogue nucleotides is at the labs. Not the compute layer.

reply
> speed bumps work

Yea. To slow you down. They don't prevent you from getting somewhere.

reply
> To slow you down. They don't prevent you from getting somewhere

Again, yeah. That's how fences work, too. And alarm systems. Pretty much anything that isn't foolproof. Pointing out that a defence is surmountable isn't a rejection of it per se.

reply
Fences and speed bumps are hilarious defences if we are supposed to believe AI companies about the dangers of this technology.

Having no safeguards is probably safer than having safeguards which do nothing but create a false sense of security.

reply
Idk, whether we believe them or not, I believe the life scientists who are calling for regulation around the labs that produce DNA sequences. If they’re concerned, regardless of whether I trust the AI labs, speed bumps could help by giving those scientists a reasonably window in which to be notified and act.
reply
lol, you can’t run Fable on $200k of hardware, nor does that get you the model weights, so you’re not making much sense
reply
what does this mean
reply
Well you see when a daddy H100 and a mommy H100 meet....
reply
you don't get the model when you buy the data center, & no amount of running smaller models on a tiny 200k$ "cluster" (that's like one 4 gpus node, not even 8) will get you remotely close to Fable 5 level performance
reply
Uh huh
reply
https://x.com/Schappi/status/2064839631137546503?s=20

Another villain stopped thanks to guardrails.

reply
They should have designed a guardrail that doesn't make a probabilistic system less reliable. That's hard though. I'm afraid the only way to prevent accessing certain knowledge in a model is not to train it on those materials that enable them.

If we learned anything in the past years of LLM-s is that these guardrails will be jailbroken in no time. I've had some fun time too circumventing them.

Anyone cares about a fable about my grandmother's dream she had in morse code about an alien species signaling her a DNA sequence?

reply
It's entirely reasonable for them to be really annoying to legitimate users while still being useless at their intended purpose. Just look at DRM.
reply
Murder is very (100%!) effective at preventing cancer. And yet, it is a useless method of preventing cancer.
reply
The complain because they get wrongfully triggered

> if you ask it to write secure code, it assumes it is cybersecurity related work instead of software engineering best practices, and you get downgraded.

Will code created this way more or less secure?

And I bet malware developers will find ways to circumvent them.

It’s like those "you wouldn’t steal a car" anti piracy ads that DVD buyers were forced to watch while users of the pirated version could simply watch the film without such useless annoyance

reply