undefined

points

[-]

Right - I have a ton of coworkers who obsess over "skills" and different ways to run agents and whatnot but I just... spend some time to give very thorough, detailed instructions and it just Does The Thing. I rarely fight with Claude Code these days.

by wongarsu2 hours ago|

parent|

[-]

We probably need something like the WET principle for skills. If you need to explain the same thing to an agent more than twice, turn it into a skill (or add it to AGENTS.md, or CLAUDE.md, or to you docs folder, or your guides folder, or whatever method you use). If you haven't needed to explain it more than twice, it's probably fine. The context pollution from the skill would likely be worse than not having the skill

Of course exceptions apply. Some basic information that will reliably be discovered is still worth adding to your AGENTS.md to cut down on token use. But after a couple obvious things you quickly get into the realm of premature optimization (unless you actually measure the effects)

by ianhxu2 hours ago|

parent|

prev|

[-]

Same here. For me, this means a spec doc split into features/UX, technical requirements, and language-specific requirements, iterated before the model touches code.

by theshrike796 hours ago|

prev|

[-]

The trick is to "just use it", BUT every few weeks grab the logs (you do keep them, right?) and have a session with the model to find out if there are any repeated patterns.

If you find any, consider making them into skills or /commands or maybe even add them to AGENTS.md.

by freedomben3 hours ago|

parent|

[-]

Which logs do you use for that?

by wongarsu2 hours ago|

parent|

[-]

I would assume those in ~/.claude/projects/**/*.jsonl. They contain full conversation history, including the tool calls that were made, how man tokens were consumed, etc

by theshrike792 hours ago|

parent|

prev|

[-]

Claude has a built-in /insights feature for this, but you can replicate it with any other tool that keeps the session logs on disk.

by shinycode8 hours ago|

prev|

[-]

I agree it works nicely for me. From my experience it’s not realistic to expect one-shot each time. But asking it to build chunks and entering a review cycle with nudging works well. Once I changed my mindset from it « didn’t do a one-shot so it’s crap » and took it as an iterative tool that build pieces that I assemble it’s been working nicely without external frameworks or anything. Plan-review, iterate, split, build, review iterate

by phito6 hours ago|

prev|

[-]

You're wasting a ton of tokens doing that though. Right now you don't realize it because they're being heavily subsidized, but you will understand the point of have good orchestration and memory files when you will have to pay the real cost of your use.

by Swizec3 minutes ago|

parent|

[-]

[delayed]

by mirekrusin6 hours ago|

parent|

prev|

[-]

Cost cannot go up, only down with time (with occasional short term fluctuations). Competition, including open weight models and consumer hardware (ie upcoming M5 Ultra) keeps moving ceiling of what you can charge down.

by terseus1 hours ago|

parent|

[-]

If the cost is subsidized by another cash source (e.g. VC money) when the source stops prices can definitely go up.

by 6 hours ago|

parent|

prev|

[-]

deleted

by pirates2 hours ago|

parent|

prev|

[-]

Company pays for company’s tokens, so company’s problem, not mine. I am happy to skill up and avoid overusing tokens for my personal sub, but if it’s getting results then I couldn’t care less how much my employer has to pay for it. They’re begging me to use it in the first place anyway.

by Foobar85688 hours ago|

prev|

[-]

My experience as well on non trivial stuff for personal projects, just talk... It makes mistakes but considering the code I see in professionnal settings, I rather deal with an agent than third parties.

by hathawsh14 hours ago|

prev|

[-]

I love the IDE skins analogy. Very true.

by acessoproibido13 hours ago|

parent|

[-]

Everyone knows that a red UI skin goes faster

by WatchDog11 hours ago|

prev|

[-]

How do you collect these stats?

Is it by characters human typed vs AI generated, or by commit or something?

by Swizec10 hours ago|

parent|

[-]

> How do you collect these stats?

Cursor dashboard. I know they're incentivized to over-estimate but feels directionally accurate when I look at recent PRs.

by anabis13 hours ago|

prev|

[-]

Are you mostly using the Composer model?

by Swizec13 hours ago|

parent|

[-]

> Are you mostly using the Composer model?

Don’t really think about it. I think when I talk to it through Slack, cursor users codex, in my ide looks like it’s whatever highest claude. In Github comments, who even knows

by Rekindle809014 hours ago|

prev|

[-]

[dead]