undefined

points

by hparadiz19 hours ago |

comments

by enraged_camel19 hours ago|

[-]

If the guardrails were so useless, people wouldn't be complaining about them.

by hparadiz19 hours ago|

parent|

[-]

People are generally complaining about false positives. Now if you really wanna know what a real criminal organization would do... They'd just buy data center hardware even if it costs 200k because a successful targeted hit could yield far in excess of that. So yes it's speed bump at best.

by JumpCrisscross19 hours ago|

parent|

[-]

> it's speed bump at best

To be fair, speed bumps work. If it's actually speed bumping nefarious activity, that gives authorities more time to react.

The correct place to police rogue nucleotides is at the labs. Not the compute layer.

by hparadiz18 hours ago|

parent|

[-]

> speed bumps work

Yea. To slow you down. They don't prevent you from getting somewhere.

by JumpCrisscross16 hours ago|

parent|

[-]

> To slow you down. They don't prevent you from getting somewhere

Again, yeah. That's how fences work, too. And alarm systems. Pretty much anything that isn't foolproof. Pointing out that a defence is surmountable isn't a rejection of it per se.

by joxdosba12 hours ago|

parent|

[-]

Fences and speed bumps are hilarious defences if we are supposed to believe AI companies about the dangers of this technology.

Having no safeguards is probably safer than having safeguards which do nothing but create a false sense of security.

by JumpCrisscross10 hours ago|

parent|

[-]

Idk, whether we believe them or not, I believe the life scientists who are calling for regulation around the labs that produce DNA sequences. If they’re concerned, regardless of whether I trust the AI labs, speed bumps could help by giving those scientists a reasonably window in which to be notified and act.

by senordevnyc9 hours ago|

parent|

prev|

[-]

lol, you can’t run Fable on $200k of hardware, nor does that get you the model weights, so you’re not making much sense

by make319 hours ago|

parent|

prev|

[-]

what does this mean

by hparadiz19 hours ago|

parent|

[-]

Well you see when a daddy H100 and a mommy H100 meet....

by make312 hours ago|

parent|

[-]

you don't get the model when you buy the data center, & no amount of running smaller models on a tiny 200k$ "cluster" (that's like one 4 gpus node, not even 8) will get you remotely close to Fable 5 level performance

by hparadiz5 hours ago|

parent|

[-]

Uh huh

by henry20234 hours ago|

parent|

prev|

[-]

https://x.com/Schappi/status/2064839631137546503?s=20

Another villain stopped thanks to guardrails.

by tiborsaas18 hours ago|

parent|

prev|

[-]

They should have designed a guardrail that doesn't make a probabilistic system less reliable. That's hard though. I'm afraid the only way to prevent accessing certain knowledge in a model is not to train it on those materials that enable them.

If we learned anything in the past years of LLM-s is that these guardrails will be jailbroken in no time. I've had some fun time too circumventing them.

Anyone cares about a fable about my grandmother's dream she had in morse code about an alien species signaling her a DNA sequence?

by josephcsible19 hours ago|

parent|

prev|

[-]

It's entirely reasonable for them to be really annoying to legitimate users while still being useless at their intended purpose. Just look at DRM.

by ceejayoz18 hours ago|

parent|

prev|

[-]

Murder is very (100%!) effective at preventing cancer. And yet, it is a useless method of preventing cancer.

by croes19 hours ago|

parent|

prev|

[-]

The complain because they get wrongfully triggered

> if you ask it to write secure code, it assumes it is cybersecurity related work instead of software engineering best practices, and you get downgraded.

Will code created this way more or less secure?

And I bet malware developers will find ways to circumvent them.

It’s like those "you wouldn’t steal a car" anti piracy ads that DVD buyers were forced to watch while users of the pirated version could simply watch the film without such useless annoyance