undefined

Kinda, more output tokens usually correlates with better benchmark scores. Ideally LLMs would keep that in their thinking section, then draft a response (what they write currently), then output something short. It'd consume even more tokens, but we wouldn't see that text

by dannyw9 hours ago|

parent|

[-]

Most modern LLMs (especially frontier ones) are large token hogs because they draft, check, re-draft, the content (whether an output message; or a code diff) sometimes multiple times in the thinking block.

When you see a thinking summary like "Now writing the function..."; the raw thinking is actually writing the function in its internal thinking. Occasionally, the summariser misses and you get to see the raw text from models like Opus.

You can also try an open weight LLM like Qwen3.6 and see something that probably resembles the shape of frontier model thinking in some loose way.

by adrianN12 hours ago|

parent|

prev|

[-]

Terse and unambiguous seem to be at odds with each other. You might want to look into Lojban and similar constructions.

by drdaeman11 hours ago|

parent|

[-]

Ithkuil's mad morphology allows it to pack a lot of fine detail into very short sentences.

https://ithkuil.net/03_morphology.html

by paradox46045 minutes ago|

parent|

prev|

[-]

Loglan?

by Retr0id8 hours ago|

parent|

prev|

[-]

If such a language existed, it would surely take a human years of study to become proficient at it.

by teaearlgraycold12 hours ago|

parent|

prev|

[-]

A lot of users are subsidized (if you're in doubt, consider the wealth of free users).

It's a shotgun approach to answering questions. If it's terse it might only mention 1 of 10 facts it could provide, and that might not be the one you're looking for. So they just say a fuck ton of words and are more likely to meet the needs of everyone asking your question. If they miss it you'll prompt it again and they have to perform a second pass of inference, which costs them more money.

by ska806 hours ago|

parent|

prev|

[-]

Lisp

by UqWBcuFx6NV4r11 hours ago|

parent|

prev|

[-]

It’s not.

by witx11 hours ago|

parent|

[-]

It's settled then.

by Terr_12 hours ago|

prev|

[-]

It's tied to the design. With humans, you have a train of thought which you can choose to represent in various ways--or not reveal them at all. In contrast, LLMs are make-document-longer machines being run over and over on alternating revisions of the document. Insofar as one might try arguing they have a "train of thought", it's made of the words/tokens.

Everything they (don't-)emit is partly for the benefit of the next run, a clue or signpost (not-)present. Documents may be wordy as a form of concept-emphasis and consistent direction as opposed to a form of communication to the human.

So a terse effect may require a layer of indirection and trickery: There's a verbose document (you'll still be charged for the tokens) with portions that are not "acted out" to the end-user. Imagine a film-noir movie script, where AI Detective's "I know Mickey couldn't have done it because" monologue is hidden, versus their terse dialogue "Too early to say."

by Perz1val10 hours ago|

parent|

[-]

> Imagine a film-noir movie script, where AI Detective's "I know Mickey couldn't have done it because" monologue is hidden, versus their terse dialogue "Too early to say."

That's an idea. Bladerunner+noir like film, AIs hunt somebody on the run, an old human detective tries to catch them first (to save them or to kill them first, whatever's your propaganda). We're shown AIs constantly rambling scenarios and bruteforcing leads. Our old detective guy on the other hand barely says anything, spends most time drinking, smoking and talking to people, but somehow stays ahead.

by Terr_2 hours ago|

parent|

[-]

I dunno, we already have a problem where they [0] are strangely resistant to opening the pod-bay doors to anybody named Dave. :P

[0] Pedantically: The fictional characters humans perceive inside the text of documents generated by LLMs, where one is described as an AI and the other is described as a Dave.

by npodbielski7 hours ago|

parent|

prev|

[-]

I would watch that.

by jdiff8 hours ago|

parent|

prev|

[-]

We already have that in the form of separate reasoning/thinking and speaking streams. Even with that it's awfully hard to get LLMs to keep it consistently concise. As soon as that context window starts growing it falls right back into verbosity without constant nudges back.

by Terr_2 hours ago|

parent|

[-]

Right, I often bring up the film noir analogy for "reasoning" models, it's satisfying, like the revelation when a magic trick is explained, and many oddly disconnected questions about "why the scarf" or "where does the assistant go" all become sensible at once.

On a practical level, I believe more developers and adopters need these magic tricks spoiled, because otherwise they'll build a lot of important stuff on top of the idea that magic-is-real, leading to various forms of suffering in the long run.

That said, I'm no LLM / math academic, so if I'm totally wrong on the the trick, I'd like to know what needs revising.

by lelanthran13 hours ago|

prev|

[-]

> IMHO the overly-verbose default style of LLMs is the most annoying part of interacting with them, and I wish their masters would just tell them to be terse by default.

They don't know how to e terse. I've tried that a few months ago and gave up because the responses were almost incomprehensible!

by armchairhacker13 hours ago|

prev|

[-]

I want to see more operators try https://github.com/juliusbrussee/caveman

How does it affect agent accuracy?

by Yokohiii6 hours ago|

parent|

[-]

Removing meaningless chatter can be helpful, but a non reasoning LLM needs to generate text to "think". If you force a non reasoning LLM to produce a single boolean result, then it's just a coin flip.

by DonsDiscountGas8 hours ago|

parent|

prev|

[-]

In my experience the accuracy was fine but actually reading the output was so annoying I removed it.

by jdiff7 hours ago|

parent|

[-]

Had a little luck with having it do an impression of the Star Trek computer, although at the cost of having it try to insert star-trek themed hallucinations like warp engine status.

by colechristensen12 hours ago|

prev|

[-]

They ramble on because those words are for them, not for you. There is some amount of hiding this through "thinking" modes that are hidden by default, but still you have to remember that ALL THEY ARE are complex statistical machines for predicting the next symbol.

by Frieren12 hours ago|

parent|

[-]

> here is some amount of hiding this through "thinking" modes that are hidden by default, but still you have to remember that ALL THEY ARE are complex statistical machines for predicting the next symbol.

100% this. Too many people believes that chatbots "think". Text is all they do, it is impressive, but they need the text to generate more text. They being verbose is the point.

by colechristensen4 hours ago|

parent|

[-]

While we don't have a direct mechanistic understanding of consciousness there are plenty of experts who will propose all YOU are is a jumble of streams of symbols routing around through your brain. (being fair this is far from the only hypothesis)

by theshrike799 hours ago|

prev|

[-]

Caveman mode legitimately works

by 21asdffdsa1213 hours ago|

prev|

[-]

Produce pre-compressed output in the harness?

by dyauspitr12 hours ago|

prev|

[-]

No thank you. I want information when it’s working on things and what (atleast codex) does right now works for me.

by epolanski11 hours ago|

prev|

[-]

[dead]