undefined

points

by chunky19949 hours ago |

[-]

> Does anyone use LLMs in such a manner that they believe it always has the most up to date information (without web search tools?).

Probably just 95% of the users. You know, the non-techies.

by bookofjoe9 hours ago|

parent|

[-]

https://archive.ph/YwpVw

by Peritract8 hours ago|

parent|

prev|

[-]

The AI hype and overstatement of capabilities is at least as strong amongst the 'techies' as the people they treat as more credulous than themselves.

by sidrag228 hours ago|

prev|

[-]

without a doubt yes. I'd encourage you to just try a session on a free chatgpt account, asking questions you think a parent or someone unfamiliar with the space would probably ask.

It will not only answer confidently incorrect, but it will not web search in obvious scenarios where it should.

The words here, aren't meant to be a warning for people in this type of community falling victim to this type of thing, its more for the general public that doesn't grasp the tools they are using, the people that wont ever wander across this article.

This i think is a huge reason we really need to jump into LLM basics classes or something similar as soon as possible. People that others consider "smart" will talk about how great chatgpt or something is, then that person will go try it out because that person they respect must be right, they'll hop on the free model and get an absurdly inferior product and not grasp why. They'll ask something that requires a web search to augment info, not get that web search, and assume the confidently incorrect agent is correct.

The thesis is also I think not entirely about not having that modern info at query time, its more scattered. Someone asks what product they should use to mash potatoes, a tool is suggested. Everyone that asks then receives that same recommendation and instead of having a range of different styles of mashing potatoes, we end up all drifting closer towards one style, and the range of variance in how food is prepared is slowly getting lost.

by xlii9 hours ago|

prev|

[-]

Gemini can be asked about current events. I was quite surprised it was able to give structured information about love boxing event in realtime.

by vorticalbox9 hours ago|

parent|

[-]

Most agent/chats have access to web search. I’m not overly surprised that it can do it but it is very nice when it actually works.

by layer88 hours ago|

prev|

[-]

Most users probably don’t ask themselves the question and simply are unwittingly affected by how the model happens to be wired.

by amluto9 hours ago|

prev|

[-]

Why do you expect web search tool calls to continue to be useful in the presence of modern AI slop farms, AI-assisted SEO, and search engines largely turning themselves into AI-based question-answering engines?

(At present, Gemini's question-answering capability (which Google kind of makes its users use) seems extremely error-prone -- much worse than competing LLMs when asked the same question.)

by fl4regun8 hours ago|

parent|

[-]

I agree with you, this is a huge concern, and we are still in an age where most content on the internet isn't ai generated yet. What about 10 years from now? We have many instances of people writing posts on reddit or uploading videos and blogs using AI generated text. What happens when that is a significant percentage of content?

I recently saw a video discussing a researcher who published a fake scientific article about a fictitious disease, with bogus author names, even a warning IN the article itself that stated "This is not a real disease, this article is not real" (paraphrasing) but still AI ended up picking up this article and serving information from it as if it was a real disease.

It even got cited in papers (which were later redacted of course), but the fact those papers got published in the first place is a serious issue.

by amluto8 hours ago|

parent|

[-]

> I recently saw a video discussing a researcher who published a fake scientific article about a fictitious disease, with bogus author names, even a warning IN the article itself that stated "This is not a real disease, this article is not real" (paraphrasing) but still AI ended up picking up this article and serving information from it as if it was a real disease.

Isn’t a lot of pretraining done by chopping sources up into short-context-window-sized pieces and then shoving them into the SGD process? The AI-in-training could be entirely incapable of correlating the beginning with the end of the article in its development of its supposed knowledge base.