undefined

upvote

points

by wolttam3 hours ago |

upvote

by WinstonSmith8435 minutes ago|

[-]

Yes, this is actually "funny" that Anthropic feels the need to build such intrusive features into Claude Code, when anybody can build a (basic) Claude Code alternative. And the Chinese labs are certainly not "anybody". One may wonder what Anthropic really tries to achieve aside from awful publicity.

reply

upvote

by thih93 hours ago|

[-]

How do people build something like a personal harness? Are there tools for that or is it done from scratch?

reply

upvote

by andai2 hours ago|

[-]

I like this tutorial for an agent in 50 lines:

http://minimal-agent.com/

And if you add one additional while loop, for user input, you can actually use it! :)

https://gist.github.com/a-n-d-a-i/5461a662ef8a7ee0a5eb7778c8...

reply

upvote

by nowittyusername2 hours ago|

[-]

Build it from scratch. Understanding fundamentals of how agentic coding harnesses is a must though if you gonna go that route. I think everyone should take time and learn these things, maybe reverse engineer Codex Cli or something like that as a starter. That info is very valuable in this day and age.

reply

upvote

by andai2 hours ago|

[-]

Can you say more about Codex? I'm using GPT-5.5 in my own harness and it's not liking it very well, so I'm thinking I ought to make it more Codexy so it's more ergonomic for it. (edit format, tool calls etc.) But haven't gotten around to it yet.

reply

upvote

by hakunin2 hours ago|

[-]

Not the comment author, but I use pi and customize it with my own extensions. Pi automatically tells models how to customize itself, so it's a pretty easy process.

reply

upvote

by wolttam2 hours ago|

[-]

I started mine from scratch in 2023 because I wanted to use LLMs from a terminal and there was nothing else compelling at the time (nowadays there is pi and opencode)

Harnesses are/can be incredibly simple things, not much more than a HTTP client that renders things in a way that suites your taste.

reply

upvote

by kolinko2 hours ago|

[-]

It’s not that difficult, it’s just a system prompt and a set of basic file edit/bash/etc tools.

Me, personally, I didn’t build it from scratch but I ported original CC from published sources into Python and extended it to match my own requirements.

reply

upvote

by andai2 hours ago|

[-]

Are you using it with Claude? They only allow their own harness with the subs right? (And per-token billing is like 10x more expensive?)

reply

upvote

by abtinf2 hours ago|

[-]

Here is a video I made explaining it from absolute basics:

https://m.youtube.com/watch?v=_AgKuFGvJfI

And the repo:

https://github.com/abtinf/homunctor

reply

upvote

by airhangerf1539 minutes ago|

[-]

I hope you've already invalidated that bearer token :-P

reply

upvote

by yomismoaqui2 hours ago|

[-]

Building something like this is the todo list of agents.

I found this one easy to understand:

https://ampcode.com/notes/how-to-build-an-agent

reply

upvote

by AJ0072 hours ago|

[-]

The real question is when do you transition from building it with codex/CC to the harness itself.

reply

upvote

by verdverm48 minutes ago|

[-]

Lots of ways, it's a good exercise that you will learn a lot doing. Might make you cynical w.r.t. big ai harnesses

I used ADK, Dagger, and a VS Code extension for mine. Currently using opencode though.

reply

upvote

by echelon2 hours ago|

[-]

Why use a personal harness?

You have to pay API pricing, which is far more costly.

I'd either switch to GLM wholesale or just continue to use Opus within Claude Code as the blessed, subsidized path.

reply

upvote

by JTbane2 hours ago|

[-]

I would guess it is to avoid model lock-in.

reply

upvote

by echelon2 hours ago|

[-]

My question is still this - why not just use GLM at that point?

The pricing of Opus outside of Claude Code is insane.

The tokens cost too much outside of Anthropic's blessed path.

reply

upvote

by andai2 hours ago|

[-]

I use GLM in my custom harness. It completes the same tasks at the same level of quality, except 8x faster and 8x cheaper. (Same goes for GPT!)

I'm not sure how that's possible. I expected to get increased correctness for that order of magnitude (something something test-time compute!) but I am not getting it.

reply

upvote

by helloplanets1 hours ago|

[-]

The issue is that using Claude Code is an easy compromise for most to make, when you get to use the models 10x cheaper than through API pricing with a custom harness.

The cheap tokens are the product.

reply

upvote

by nananana91 hours ago|

[-]

Which is why my vibeslop harness supports `claude -p` as one of its backends.

reply

upvote

by helloplanets1 hours ago|

[-]

If that ain't getting steganographically tagged...

reply

upvote

by tonmoy3 hours ago|

[-]

What models are you using? Aren’t you still dealing with some provider even if you are not using their binary

reply

upvote

by wolttam3 hours ago|

[-]

I self-host DeepSeek V4 Flash on 2 DGX Sparks (approx. $10k)

I expect DeepSeek V4 Flash (or an equivalently sized model) to reach parity with GLM 5.2 some time this year (this based on DeepSeek V4 Flash launching at GLM 5.0 parity[0], and GLM 5.2 being freely available to distill from)

GLM 5.2 is within spitting distance of Opus 4.8 and is at least as good as Opus 4.6[1] which some devs were willing to spend hundreds to single-digit thousands of dollars a month for a few months ago.

[0]: https://artificialanalysis.ai/models/comparisons/deepseek-v4...

[1]: https://artificialanalysis.ai/models/comparisons/claude-opus...

reply

upvote

by ipsod2 hours ago|

[-]

How fast is it?

reply

upvote

by wolttam2 hours ago|

[-]

2000 t/s prompt processing and 40-50 t/s generation. We should see 60-70 t/s generation with DSpark support solidifying in vLLM in a few days

Recent discussion on DSpark: https://news.ycombinator.com/item?id=48696585

reply

upvote

by 2 hours ago|

[-]

deleted

reply

upvote

by krupan2 hours ago|

[-]

Given the Anthropic shenanigans, do you trust the personal harness code it wrote for you?

reply

upvote

by wolttam2 hours ago|

[-]

It did not write it for me, I used it to add a feature I wanted. It's a pretty small and understandable codebase, in fact :)

reply

upvote

by MichaelZuo2 hours ago|

[-]

Does anyone know what’s gone wrong with Anthropic?

They used to be a decently credible company with not-too-shady behaviour...

I hope they can actually regain some credibility…

reply

upvote

by hombre_fatal2 hours ago|

[-]

I don't think many people care that they are trying to detect resellers and distillation.

It also doesn't seem very consistent to fixate on that while sending Anthropic everything about you via your day to day prompts, every line of the projects and environments you're working on at work, etc.

Their credibility comes from having one of the best models.

reply

upvote

by MichaelZuo2 hours ago|

[-]

This sounds similar to what people were saying regarding Microsoft when the shady tricks of consumer Windows 10 versions were revealed.

…And then Windows 11 became even worse.

reply

upvote

by satvikpendem1 hours ago|

[-]

When have they ever been credible? They have always been shady with their talk of safety, Dario was the one who wrote back in 2019 that GPT 2 was too dangerous to release.

reply

upvote

by slowmovintarget2 hours ago|

[-]

Their philosophy is what's gone wrong.

It has some good effects on the their models, like Claude seeking cooperation first. But the people behind the company have a typical "unconstrained" (in the Sowell vision sense) perspective that assumes that they know better, so they are righteous for attempting to control things (users, paying customers, their model outputs, their tool chain, the supposed deity they assume they will produce... etc.)

reply

upvote

by pishpash2 hours ago|

[-]

Amodei world: pompous zealot with God complex

Altman world: malfeasant nihilist with God complex

reply

upvote

by MichaelZuo2 hours ago|

[-]

Yeah I guess there is a slight undertone that they are the superiors… with the rest of the tech world being the inferiors.

But I hadn’t thought that as anything more than temporary flights of fancy.

reply

upvote

by AlexandrB2 hours ago|

[-]

They've only been around 5 years and have grown tremendously during that time. There's no stable reputation you can rely on yet.

reply

upvote

by skeptic_ai2 hours ago|

[-]

They just show their true face. You’ve been lied all this time. They were never “good”.

reply

upvote

by MichaelZuo2 hours ago|

[-]

I used to interact with the LW crowd… and they were mostly not outright swindlers or scoundrels. (from what I could sense)

I think it’s fair to say most had decent respectability.

Anthropic hired heavily from that pool so it’s astonishing how it turned out.

reply

upvote

by imhoguy2 hours ago|

[-]

Enshitification. Too big to.. upset the govt.

reply

upvote

by SubiculumCode3 hours ago|

[-]

[flagged]

reply

upvote

by tiahura3 hours ago|

[-]

Phased rollouts are a triggering microagression for some.

reply

upvote

by 3 hours ago|

[-]

deleted

reply