undefined

points

by amelius9 hours ago |

comments

by guerrilla9 hours ago|

[-]

Why not? It did the work. Why should you expect it to be omniscient?

We can rank them based on how much they know and people will gravitate towards those that do know more.

It's a market after all.

by mmmattt9 hours ago|

parent|

[-]

If it’s a market, wouldn’t the incentive be to lie about knowing and thus to keep the hallucinations?

by BDPW9 hours ago|

parent|

[-]

If you had an llm that could accurately predict when a claim is uncertain it would be very popular, I think. I would pay for that kind of reliability tbh

by intended8 hours ago|

parent|

[-]

This would break reality. There’s some underlying physical law that prevents the existence of any algorithm of truth.

by embedding-shape8 hours ago|

parent|

[-]

> There’s some underlying physical law that prevents the existence of any algorithm of truth

Haven't heard about that law, but seems unlikely we can come up with ("discover") any sort of law that uses a concept ("truth") humans can't even agree what it means, and that's not for a lack of trying, we've been trying to figure it out for millenniums already with no end in sight.

by derektank8 hours ago|

parent|

prev|

[-]

If you accept certain axioms a priori, it’s fine. If you simply let the machine intelligence take it for granted that induction works because nature is uniform and give it some way to test its predictions, it would have all the building blocks it needs to reason out a lot of very useful information. Which as the parent comment points out, people would absolutely pay a lot of money for.

by Greenpants9 hours ago|

parent|

prev|

[-]

Up to the point where consumers notice and decide to stop using these models because of it.

Might be why we're already rarely seeing models output an "I don't know".

by nutjob28 hours ago|

parent|

prev|

[-]

According to your logic the market will produce an LLM that consists only of 'PRINT "I don't know."'.

by skillina8 hours ago|

prev|

[-]

"I don't know" has positive value, presumably you could prompt further to learn more about where it got stuck. It also increases the value of correct answers, by improving confidence that answers are actually correct.

"Confidently incorrect" has negative value. At best, a human realizes the answer is wrong and At worst, the incorrect information makes is not identified and can cause untold damage. By having the potential to be so severely wrong, it lessens the value of correct answers because there is a lower confidence value on their output.

by embedding-shape8 hours ago|

prev|

[-]

Depends on what your understanding of the product is.

If someone sold you a "Solved all your problems" machine, and it suddenly doesn't solve all your problems, then probably no, you shouldn't pay.

But the way I'm being sold LLMs, is basically "A text generator that gives your plausible-sounding human text that sometimes hallucinates and gets things wrong, based on your input", then regardless of what the outcome is, I still made use of the "Input > Output" part, which is what I bought into, so I should still pay for that.

Now of course bunch of people will say they been sold the former, but the companies themselves seem to be selling the latter. That's my perspective from a person who doesn't follow "influencers" and what not though, which seem to be selling the public on the former rather than the latter.

by al_borland7 hours ago|

parent|

[-]

Let's pretend I am someone who has heard people talk about ChatGPT, but have on idea what it actually is. I go to the website and am not presented with any information, just a prompt. So I ask it what it is and what it can do for me.

My ask:

> In a couple sentences, explain to me the product I'm being sold with ChatGPT. What does it do for me?

The Reply from ChatGPT:

> ChatGPT is a conversational AI that helps you think, create, learn, analyze, and get things done faster. You can use it to answer questions, draft and edit writing, summarize information, brainstorm ideas, learn new topics, write code, plan projects, and increasingly act as an assistant that can search for information, work with documents, generate images, and help complete tasks.

> In simple terms: you're buying access to an AI that turns natural language into useful work—saving time, expanding your capabilities, and giving you an always-available collaborator for both everyday tasks and specialized knowledge work.

This sounds much more like the former, a "solve all your problems" machine.... not a plausible-sounding text generation machine.

Only two weeks ago Sam Altman said their new data center "could" be where cancer gets cured[0]. It is only the people who deeply understand AI who see it as a text generator of plausible-sounding text. That isn't what the marketing department, the CEO, or the product itself seem to be saying. I'm using OpenAI as the example here, but the others don't seem much different.

[0] https://www.youtube.com/watch?v=9-tOtbDDrJA

by embedding-shape7 hours ago|

parent|

[-]

In this hypothetical case of a us being new users, you now know it's a conversational AI, so you continue asking:

> Can I trust the output you give me?

And I assume it explains what to trust VS not.

I think in the bottom you should also see something like "Any text can contain mistakes" or similar too, which I know is a far cry from what some people push in the press in regards to capabilities, but I still don't see the platforms themselves as lying about this, while I do see a bunch of people constantly over-hyping the possibilities.

by al_borland3 hours ago|

parent|

[-]

I don't think coming at it from the perspective of a new user is that hypothetical. All current users were new users in just the last 3 years. There are still a significant number of people who have heard of it, but haven't used it, or are still very new to it.

I'm not sure why "can I trust the output you give me?" would be a logical followup to the first response it gave me, seeing as it's response didn't say anything about hallucinations or mistakes. It said it could do "useful work" with all kinds of examples, including "specialized knowledge work".

The note under the text field, in gray as to not draw the user's attention, feels more like a CYA line from the lawyers, rather than an instruction they really want users to take to heart. That line also doesn't appear on the main home page. I only shows up after the first prompt is submitted and focus shifts to the conversation. I don't think a CYA line in gray fine print is enough to make users understand it's a plausible-sounding text generation machine instead of an answer machine. Even if I ask that point blank it gives a wordy... yes, but not really, it's being debated by philosophers... response.

by skinfaxi7 hours ago|

parent|

prev|

[-]

The marketing materials are very much the former though. From claude.com:

> If you can dream it, Claude can help you do it. Claude can process large amounts of information, brainstorm ideas, generate text and code, help you understand subjects, coach you through difficult situations, simplify your busywork so you can focus on what matters most, and so much more.

What marketing copy have you read for LLMs that is like you mentioned?

> But the way I'm being sold LLMs, is basically "A text generator that gives your plausible-sounding human text that sometimes hallucinates and gets things wrong, based on your input"

by amelius3 hours ago|

parent|

prev|

[-]

They are selling the former to investors, while selling the latter to us.

by ludwik7 hours ago|

prev|

[-]

I would be very willing to pay more! The choice between “you may get a correct answer, or you may get lied to, without a clear way to distinguish between the two” and “you may get a correct answer, or a clear indication that the answer was not found” is pretty clear. One is a much more useful tool than the other. I don’t see any real incentives for companies making LLMs to keep their AI factually unreliable. (Full disclosure: I work for one, but I’m definitely not in the rooms where such decisions would be made.)

by 7 hours ago|

prev|

[-]

deleted

by maxbond6 hours ago|

prev|

[-]

Would you rather pay for a nonsensical explanation?

by nutjob28 hours ago|

prev|

[-]

'I don't know' is the correct answer for infinitley more questions than those that can be answered.