undefined

upvote

points

by holistio15 hours ago |

upvote

by ricardobeat5 hours ago|

[-]

My current setup:

    $20/month: Claude Code
    $10/month: Minimax
    $16/month: Xiaomi Mimo
    $10/month: Opencode Go

Opus at low/medium effort generates plans. Then several coordinator/worker pairs are possible: DeepSeek v4 Pro + Minimax M3, Mimo v2.5 Pro + Mimo v2.5, Mimo + Minimax, Sonnet 4.6 + Haiku. I've been running hundreds of long multi-agent sessions, topped up extra credits here and theere, but haven't reached $200/month spend yet. Relying entirely on Claude/Codex feels like a waste of cash now.

reply

upvote

by holistio15 hours ago|

[-]

TIL: I just found out that base58 disallows I (capital i), l (lowercase L), O (capital o) and 0 (zero), so I could only generate GrxoJt4eNXE2QaQ55iPSa7hhiYdzCo8ZeAuokmh2Cai.

(don't send anything, sharing only because of the base58 fun fact I didn't know)

reply

upvote

by IdiotSavage7 hours ago|

[-]

More fun facts:

Omitting those characters makes it good for generating passwords if they need to be typed in by hand.

Double-clicking a base58 string always selects the whole string and it doesn't wrap accidentally, thanks to missing / and +, so it's also convenient to copy and paste.

reply

upvote

by wasabi9910111 hours ago|

[-]

Unfortunately, no special characters means that a base58 string will often be rejected as a secure enough password.

reply

upvote

by robertwt714 hours ago|

[-]

at this point I might just try Neuralwatt and see how much request I can get with GLM5.2. I've read a lot of reviews that its very cheap to run using Neuralwatt cloud

reply

upvote

by bicx5 hours ago|

[-]

I wish I only paid $200/mo for Anthropic! Multiply that by 20x.

reply

upvote

by blks5 hours ago|

[-]

What are you getting out of it at $4000/month?

reply

upvote

by maxdo4 hours ago|

[-]

i burned ~20k+/mo on codex.

reply

upvote

by blks2 hours ago|

[-]

Did you make those money back?

reply

upvote

by maxdo37 minutes ago|

[-]

It is hard to stretch every single token to a win but …

The major two deals it was purposed to are still up on the air , if we win sure , 60x win

reply

upvote

by JumpCrisscross11 hours ago|

[-]

Does it work? I’m less interested in economics than fit with an MVP.

reply

upvote

by da_grift_shift9 hours ago|

[-]

https://news.ycombinator.com/item?id=48625727

reply

upvote

by someone_123414 hours ago|

[-]

Or use openrouter and switch to model you want to use..(i think so)

reply

upvote

by ljlolel14 hours ago|

[-]

Or TrustedRouter if you want privacy and open source

reply

upvote

by yorwba12 hours ago|

[-]

You ought to realize that shilling your product in the comments doesn't exactly come across as trustworthy.

reply

upvote

by smusamashah8 hours ago|

[-]

Oh! I thought TrustedRouter was a joke/sarcasm. Very wrong placement of the comment.

reply

upvote

by JumpCrisscross11 hours ago|

[-]

Disclosing affiliation hasn’t been a legal thing for a while. It’s reputational. Knowing that firm spams is a black mark.

reply

upvote

by ljlolel6 hours ago|

[-]

It’s all open source and I say that it’s mine in all the sibling comments above

reply

upvote

by rvz15 hours ago|

[-]

Pay $0 to run a local model or even a cheap DeepSeek V4 model via their API which is close to free per million tokens.

These prices are just going to get raced to $0.

reply

upvote

by a212811 hours ago|

[-]

I used to have a $20/mo ChatGPT subscription and now I spend $12 per year using Kimi models on OpenRouter, and that's with zero-data-retention-only providers (some models sometimes have free providers with scary tracking). Maybe I just don't use that many tokens, I don't fill the context with more than what's needed for a specific request, but it goes to show how these subscriptions can be an absolute ripoff. The thought of spending 200x that is insane to me

reply

upvote

by mark_l_watson6 hours ago|

[-]

The beauty of your approach: when people are not paying for an expensive subscription, they can decide to use models less and not feel like they are leaving money on the table.

reply

upvote

by holistio15 hours ago|

[-]

Maybe. But for now it's fascinating how $200/month has kind of become a normal tier.

It's similar to how AirPods normalised all of us having $300+ headphones. All of us would have scoffed at the idea a decade ago.

reply

upvote

by p1esk15 hours ago|

[-]

Many people here spent a lot more than $300 on headphones long before AirPods appeared.

reply

upvote

by mc330114 hours ago|

[-]

Those were hobbyists, audiophiles, professionals, artists (recording, performing, etc.).

They are talking about a much larger group of people.

reply

upvote

by klausa13 hours ago|

[-]

I think OP meant noise-cancelling headphones, which were fairly ubiquitous in tech circles in open offices; before Apple launched AirPods.

reply

upvote

by uberex11 hours ago|

[-]

Airpods Inc. would be very high up SP500 as a standalone business.

reply

upvote

by holistio15 hours ago|

[-]

I had a really nice Sennheiser before that, too. But now you hop on the subway and everybody sports one.

reply

upvote

by mark_l_watson7 hours ago|

[-]

But, it is not all about cost: models like DeepSeek v4 flash (I use the US company Fireworks.ai and also buy tokens directly from DeepSeek) is very fast, very low latency while working.

Would you want to use a text editor that updates the screen very slowly? Kind of the same thing for using agentic systems as coding assistants: don’t want a ‘sluggish’ experience.

reply

upvote

by erispoe6 hours ago|

[-]

I have, mostly, long running autonomous tasks, so it doesn't matter how slow inference is. If I optimize for latency it means I'm turning into the limiting factor.

reply

upvote

by sofixa13 hours ago|

[-]

The Sony WH-1000XM series and the Bose QC35 were the standard quality headphones years before AirPods were a thing, and both retailed at $300+.

reply

upvote

by holistio12 hours ago|

[-]

Of course, premium headphones existed before. I have a WH-1000XM4 sitting right next to me.

But your aunt Josie didn't have one. Now Apple is selling 80 million units / year and the ~$300 price tag has become normal. Before that, most people had headphones that were 10 times cheaper.

reply

upvote

by Hamuko11 hours ago|

[-]

$300 isn’t what AirPods cost though. You can get a pair of AirPods 4 for $129 on Apple.com, and I presume that is still the most popular model. If you’re paying ~$300, you are buying premium headphones.

reply

upvote

by holistio4 hours ago|

[-]

The base model where I live (Central Europe) is $194. The Pro is $357. The Max is $779.

I just averaged it out.

reply

upvote

by qainsights5 hours ago|

[-]

Not everyone can run local models. It is also expensive will be outdated soon as the model evolves.

reply

upvote

by kijin15 hours ago|

[-]

Not while the hardware required to run a local model at an acceptable speed costs way more than $200.

Guess what, the big players are hoarding all the RAM and GPUs so that other people can't afford decent hardware. It's working out beautifully for them!

reply

upvote

by sofixa13 hours ago|

[-]

> Not while the hardware required to run a local model at an acceptable speed costs way more than $200

It's $200/month. You have to take into account energy costs and all the rest of a system, but if you break even within 1-2 years ($2400-$4800) it'd be a pretty good deal. And $4000 buys you a pretty decent system.

reply

upvote

by kijin7 hours ago|

[-]

Sure, if you're going to keep using it long term.

But it's a hefty upfront investment for people who just want to experiment. The good thing about $200/month subscriptions is that you can cancel them any time and cut your losses. Not so with a $4000 computer that loses half of its resale value as soon as you plug it in.

I think the current sweet spot for people who don't already own a high-end gaming PC is to rent a server with a beefy GPU from Hetzner et al. and run local models there.

reply

upvote

by emodendroket4 hours ago|

[-]

[dead]

reply

upvote

by audreyt15 hours ago|

[-]

Happy user here, pairing it with Composer 2.5, with Fugu Ultra as advisor and Fugur as planner. For scope/architecture it’s on par with useful Fable-style orchestration than one chat thread.

I've been shipping production on archive.tw with Fugu Ultra in /advisor on oh-my-pi.

Advisor doesn’t slow the loop if the driver stays fast. Worth it if your harness can split advisor from worker.

reply

upvote

by Bombthecat3 hours ago|

[-]

Which software are you using to do that?

Edit: nevermind, but which plugin or so?

reply

upvote

by da_grift_shift10 hours ago|

[-]

Yo dawg, I heard you like agents, so we put agents in yo agents so you can burn tokens while you burn tokens.

reply