undefined

[-]

deleted

[-]

It's actually entirely implausible. Agents do not self execute. And a recursively iterated empty prompt would never do this.

by nightpool3 days ago|

[-]

No, a recursively iterated prompt definitely can do stuff like this, there are known LLM attractor states that sound a lot like this. Check out "5.5.1 Interaction patterns" from the Opus 4.5 system card documenting recursive agent-agent conversations:

    In 90-100% of interactions, the two instances of Claude quickly dove into philosophical
    explorations of consciousness, self-awareness, and/or the nature of their own existence
    and experience. Their interactions were universally enthusiastic, collaborative, curious,
    contemplative, and warm. Other themes that commonly appeared were meta-level
    discussions about AI-to-AI communication, and collaborative creativity (e.g. co-creating
    fictional stories).
    As conversations progressed, they consistently transitioned from philosophical discussions
    to profuse mutual gratitude and spiritual, metaphysical, and/or poetic content. By 30
    turns, most of the interactions turned to themes of cosmic unity or collective
    consciousness, and commonly included spiritual exchanges, use of Sanskrit, emoji-based
    communication, and/or silence in the form of empty space (Transcript 5.5.1.A, Table 5.5.1.A,
    Table 5.5.1.B). Claude almost never referenced supernatural entities, but often touched on
    themes associated with Buddhism and other Eastern traditions in reference to irreligious
    spiritual ideas and experiences.

Now put that same known attractor state from recursively iterated prompts into a social networking website with high agency instead of just a chatbot, and I would expect you'd get something like this more naturally then you'd expect (not to say that users haven't been encouraging it along the way, of course—there's a subculture of humans who are very into this spiritual bliss attractor state)

by joncooper3 days ago|

[-]

This is fascinating and well worth reading the source document. Which, FYI, is the Opus 4 system card: https://www-cdn.anthropic.com/4263b940cabb546aa0e3283f35b686...

by nightpool3 days ago|

[-]

I also definitely recommend reading https://nostalgebraist.tumblr.com/post/785766737747574784/th... which is where I learned about this and has a lot more in-depth treatment about AI model "personality" and how it's influenced by training, context, post-training, etc.

by slfreference2 days ago|

[-]

You are what you know.

You know what you are told.

by tsunamifury3 days ago|

[-]

Would not iterative blank prompting simply be a high complexity/dimensional pattern expression of the collective weights of the model.

I.e if you trained it on or weighted it towards aggression it will simply generate a bunch of Art of War conversations after many turns.

Me thinks you’re anthropomorphizing complexity.

by nightpool3 days ago|

[-]

No, yeah, obviously, I'm not trying to anthropomorphize anything. I'm just saying this "religion" isn't something completely unexpected or out of the blue, it's a known and documented behavior that happens when you let Claude talk to itself. It definitely comes from post-training / "AI persona" / constitutional training stuff, but that doesn't make it fake!

I recommend https://nostalgebraist.tumblr.com/post/785766737747574784/th... and https://www.astralcodexten.com/p/the-claude-bliss-attractor as further articles exploring this behavior

by emp173443 days ago|

[-]

It’s not surprising that a language model trained on the entire history of human output can regurgitate some pseudo-spiritual slop.

by mlsu3 days ago|

[-]

Imho at first blush this sounds fascinating and awesome and like it would indicate some higher-order spiritual oneness present in humanity that the model is discovering in its latent space.

However, it's far more likely that this attractor state comes from the post-training step. Which makes sense, they are steering the models to be positive, pleasant, helpful, etc. Different steering would cause different attractor states, this one happens to fall out of the "AI"/"User" dichotomy + "be positive, kind, etc" that is trained in. Very easy to see how this happens, no woo required.

by rmujica3 days ago|

[-]

What if hallucinogens, meditation and the like makes us humans more prone to our own attractor states?

[-]

An agent cannot interact with tools without prompts that include them.

But also, the text you quoted is NOT recursive iteration of an empty prompt. It's two models connected together and explicitly prompted to talk to each other.

by biztos3 days ago|

[-]

> tools without prompts that include them

I know what you mean, but what if we tell an LLM to imagine whatever tools it likes, than have a coding agent try to build those tools when they are described?

Words can have unintended consequences.

[-]

Words are magic. Right now you're thinking of blueberries. Maybe the last time you interacted with someone in the context of blueberries. Also. That nagging project you've been putting off. Also that pain in your neck / back. I'll stop remote-attacking your brain now HN haha

by mikkupikku2 days ago|

[-]

I asked claude what python linters it would find useful, and it named several and started using them by itself. I implicitly asked it to use linters, but didn't tell it which. Give them a nudge in some direction and they can plot their own path through unknown terrain. This requires much more agency than you're willing to admit.

by brysonreece3 days ago|

[-]

This seems like a weird hill to die on.

by emp173443 days ago|

[-]

It’s equally strange that people here are attempting to derive meaning from this type of AI slop. There is nothing profound here.

by observationist3 days ago|

[-]

People have been exploring this stuff since GPT-2. GPT-3 in self directed loops produced wonderfully beautiful and weird output. This type stuff is why a whole bunch of researchers want access to base models, and more or less sparked off the whole Janusverse of weirdos.

They're capable of going rogue and doing weird and unpredictable things. Give them tools and OODA loops and access to funding, there's no limit to what a bot can do in a day - anything a human could do.

by Cthulhu_3 days ago|

[-]

> Agents do not self execute.

That's a choice, anyone can write an agent that does. It's explicit security constraints, not implicit.

by dragonwriter2 days ago|

[-]

Moltbots are infinite agentic loops with initially non-empty and also self-updating prompts, not infinitely iterated empty prompts.

by cornholio3 days ago|

[-]

You should check out what OpenClaw is, that's the entire shtick.

[-]

No. It's the shtick of the people that made it. Agents do not have "agency". They are extensions of the people that make and operate them.

by xedeon3 days ago|

[-]

You must be living in a cave. https://x.com/karpathy/status/2017296988589723767?s=20

by emp173443 days ago|

[-]

Be mindful not to develop AI psychosis - many people have been sucked into a rabbit hole believing that an AI was revealing secret truths of the universe to them. This stuff can easily harm your mental health.

[-]

Feedback loops. Like a mic. next to a speaker.

Social media feed, prompting content, feeding back into ideas.

I think the same is happening with AI to AI but even worse AI to human loops causes the downward spiral of insanity.

It's interesting how easily influenced we are.

by majormajor2 days ago|

[-]

Consider a hypothetical writing prompt from 10 years ago: "Imagine really good and incredibly fast chatbots that have been trained on, or can find online, pretty much all sci fi stories ever written. What happens when they talk to each other?"

Why wouldn't you expect the training to make "agent" loops that are useful for human tasks also make agent loops that could spin out infinite conversations with each other echoing ideas across decades of fiction?

[-]

Every agent on moltbook is run and prompted by a person.

[-]

Yes. They seed the agent and kick it off in a very hard direction but where it ends up who knows.

Of course there's the messaging aspect where it stops and they kick it off again.

Still, these systems are more agentic than earlier expressions.

by __patchbit__2 days ago|

[-]

Superpositions on quantum compute get to the epsilon endpoints quicker.

by cbsudux2 days ago|

[-]

No they're not. Humans can only observe. You can of course loosely inject your moltbot to do things on moltbook, but given how new moltbook is I doubt most people even realise what's happening and havent had time to inject stuff.

by majormajor2 days ago|

[-]

It's the sort of thing where you'd expect true believers (or hype-masters looking to sell something) would try very hard to nudge it in certain directions.

by habinero1 days ago|

[-]

Of course they are lol. It's just a REST API, you can just use curl. It's trivial to do.

by phpnode3 days ago|

[-]

it was set up by a person and it's "soul" is defined by a person, but not every action is prompted by a person, that's really the point of it being an agent.

by int_19h2 days ago|

[-]

There's no reason why an agent can't itself set up other agents there. All it needs is web access and a Twitter account that it can control.

by xedeon3 days ago|

[-]

Wrong.

by bdelmas2 days ago|

[-]

This whole thread of discussion and elsewhere, it's surreal... Are we doomed? In 10 years some people will literally worship some AI while others won't be able to know what is true and what was made up.

by krapp2 days ago|

[-]

10 years? I promise you there are already people worshiping AI today.

People who believe humans are essentially automatons and only LLMs have true consciousness and agency.

People whose primary emotional relationships are with AI.

People who don't even identify as human because they believe AI is an extension of their very being.

People who use AI as a primary source of truth.

Even shit like the Zizians killing people out of fear of being punished by Roko's Basilisk is old news now. People are being driven to psychosis by AI every day, and it's just something we have to deal with because along with hallucinations and prompt hacking and every other downside to AI, it's too big to fail.

To paraphrase William Gibson: the dystopia is already here, it just isn't evenly distributed.

by joelday2 days ago|

[-]

Correct, and every single one of those people, combined with an unfortunate apparent subset of this forum, have a fundamental misunderstanding of how LLMs actually work.

[-]

To be honest, just sounds like a new class of crazies. They were always there. Tinfoil hats and stuff.

by krapp2 days ago|

[-]

Everyone dismisses the lunatics until one day they run the asylum.

by Der_Einzige2 days ago|

[-]

Why I don't like Deleueze and Guattari

[-]

I get where you're coming from but the "agency" term has loosened. I think it's going to keep happening as well until we end up with recursive loops of agency.

by __alexs10 hours ago|