undefined

points

by gbro3n6 hours ago |

comments

by weird-eye-issue6 hours ago|

[-]

Let me let you in on a little industry "secret"

You can't trust those results no matter what

The pages that they pull in to source that data all contain affiliate links and companies contact websites to get their tools to the tops of those lists by paying money often monthly. I know this because I do this...

It's basically standard SEO but it also manipulates AI like ChatGPT very very easily

by SlinkyOnStairs4 hours ago|

parent|

[-]

> It's basically standard SEO but it also manipulates AI like ChatGPT very very easily

There are key differences.

1) Google doesn't get paid for the SEO, so even is crime is involved, Google isn't directly responsible.

2) AI ads are unmarked, which is illegal pretty much everywhere. And because of the way LLMs work, it is impossible to tell where a given output came from, neither which part of the prompt/context nor whether it's from the prompt or training.

by jmathai4 hours ago|

parent|

[-]

> 1) Google doesn't get paid for the SEO, so even is crime is involved, Google isn't directly responsible.

Google doesn't get paid directly for the SEO but they definitely benefit monetarily. Do a recipe search and ask yourself if these are the results the user would like to see. Google benefits by not penalizing sites which litter themselves with ads. It's not that indirect.

by threetonesun1 hours ago|

parent|

prev|

[-]

Why would AI ads be unmarked? Most of the Google AI search results I get show sources. They're just summarizing top results for you, injecting a ad shown as an ad into that isn't tremendously different than how Google worked before.

by weird-eye-issue4 hours ago|

parent|

prev|

[-]

I'm just talking about the methods that business owners can use for getting good SEO or AI recommendations are basically the same thing, not sure what point you are trying to make?

by faangguyindia6 hours ago|

parent|

prev|

[-]

Simplest way to do is by running affiliate program for your SaaS and shady marketers will do everything to get sales if it's profitable.

by weird-eye-issue5 hours ago|

parent|

[-]

Eh not really

They won't get you on any worthwhile list unless it's their own because it's too risky for them and any site they would publish it on would want to use their own affiliate link. Unless of course we are talking about something like Medium or YouTube which does work

And then of course there's the fraudsters who will bid on branded keywords we have banned dozens of people for that

by nunez2 hours ago|

parent|

prev|

[-]

The cheat code for that used to be Reddit before they got growth-hacked 10+ years ago.

by weird-eye-issue2 hours ago|

parent|

[-]

Actually Reddit is receiving more organic traffic than ever before and is more valuable to game than ever

But yes actually I was doing this about 15 years ago in the men's fashion subreddit for one of my companies lol

by kalaksi1 hours ago|

parent|

[-]

Can you elaborate a bit on how that looks like in practice?

by reactordev6 hours ago|

parent|

prev|

[-]

This is why local AI is so important

by bayindirh6 hours ago|

parent|

[-]

It's already being trained on "public" (ethical or otherwise) data. So, it already has ingested that kind of "optimization" during pre-training and training.

I don't think you can fine-tune your way out of it.

by ToucanLoucan5 hours ago|

parent|

[-]

People still think these things are smart. That if their word generator eats enough of the Internet, it will somehow give them the real information that's otherwise hidden. Or perhaps a better word; filter the bullshit.

To filter bullshit it would first have to understand bullshit, and it doesn't. That's why an LLM will tell you the solution to a problem that doesn't work, and argue with you when you correct it.

by reactordev1 hours ago|

parent|

[-]

Sadly critical thinking skills have atrophied steeply in the last decade.

by bayindirh5 hours ago|

parent|

prev|

[-]

This is what bothers me a lot. For the people who doesn't know how it's made or want to believe, it's a miracle.

For me, it's a resource wasting text generator. I'll not lie, I don't use OpenAI, Mistral or Anthropic's models, even for coding. I prefer to read my API docs and cry once.

I used Gemini, five or six times in total. Twice I asked a couple of very specific things, and it unearthed them. Since they were not products, but information, that was helpful. Twice, it has given wrong information. When I "told" it, there was another way, it said "of course there are two ways", etc. Tasteless and time wasting.

I don't like using an LLM all day long, or offload my thinking to them. It's the ultimate self-poisoning incident.

And as you say, these algorithms can't know right/wrong/logical/bullshit, etc. They just spew out text.

by satvikpendem3 hours ago|

parent|

[-]

I was just reading another post yesterday and your comment reminds me of this one [0], same sort of format and experience of the submitted article of the HN post that comment is on.

[0] https://news.ycombinator.com/item?id=48211730

by latexr4 hours ago|

parent|

prev|

[-]

Something I’ve also seen multiple times is an LLM giving wrong information, I tell it it’s not right, then it tells me I’m “absolutely right” and it provides the exact same answer and tells me that one will work.

by fsflover5 hours ago|

parent|

prev|

[-]

This is far from widespread at the moment, so it'll be possible to at least use the current cutting-edge models locally in the future.

by bayindirh5 hours ago|

parent|

[-]

Far from widespread? SEO has seeped to all crevices of the internet for the last 20 years.

by fsflover4 hours ago|

parent|

[-]

By this measure, any information you can get whatsoever is biased and there is no reason to trust anything at all.

by satvikpendem3 hours ago|

parent|

[-]

All information has some sort of bias, as no information can truly be unbiased. There is no reason to trust any specific piece of information but taken in aggregate one can disambiguate the biases.

by fsflover2 hours ago|

parent|

[-]

So we are in agreement.

by latexr4 hours ago|

parent|

prev|

[-]

The major difference is that right now when you land on a page you can do your due diligence and decide if you trust the source. You can still be tricked, but it’s harder and you can get better at the detection.

With LLMs, everything is given the same importance so you have no idea if the data came from a reputable source or an obvious SEO junk website.

by fsflover3 hours ago|

parent|

[-]

AI can also provide the sources. And if you need to be certain, you should ask for that.

by rplnt6 hours ago|

parent|

prev|

[-]

That doesn't solve this particular problem. Your local model was trained on reddit comments written by bots.

by soloto5 hours ago|

parent|

prev|

[-]

Local AI will have the bias that existed at the time of its training, which is different from no bias. For stuff that needs to be current, a local LLM would need to search the net regardless.

by embedding-shape5 hours ago|

parent|

[-]

And since "no bias" isn't something that actually exists in reality when it comes to language or even anything near humans, "bias in local model I can introspect" will always be miles ahead of "bias I know is there, but cannot introspect".

by Schweigerose5 hours ago|

parent|

prev|

[-]

How do you make sure that the model you run locally is not tainted? Is there even a way to confirm this without providing the complete training set?

by psb55 hours ago|

parent|

[-]

Fwiw I just run kiwix/zeal locally which has old school search index of all articles in wiki/stackoverflow etc. That seems enough for most of my day to day use.

by jondea5 hours ago|

parent|

prev|

[-]

It's less compromised, but it's still basing the answer on compromised queries. This is why I pay for independent reviews (e.g Which) where their incentives are more aligned with yours.

by 4 hours ago|

parent|

prev|

[-]

deleted

by rdtsc5 hours ago|

parent|

prev|

[-]

Not if the models come from Google. The ads will be implicit in the model. X is better that Y an Z would be easy to add to a the training set.

by pautasso3 hours ago|

parent|

[-]

Does this mean the model must be retrained every time a new ad is posted? How much are AI ads going to cost?

by rdtsc3 hours ago|

parent|

[-]

Yeah, I meant not individual ads but implicit forced/influenced preference for certain brands. Let’s say it always picks Coke vs Pepsi when giving an example of a soft drink. Or picks BMW when asked to pick the best car. Which cloud provider is the best? -Why, GCP of course, etc.

Companies then get to bid for a preference “place”. This is more like Google paying to be the search engine default in Firefox.

by FergusArgyll6 hours ago|

parent|

prev|

[-]

How does that help if it's using search? You get whatever the search engine outputs

by weird-eye-issue5 hours ago|

parent|

prev|

[-]

Local AI models pull in search results just like ChatGPT does ...

And they are trained on web data just like any other model...

by nekzn6 hours ago|

prev|

[-]

Sorry to tell you that all websites you get when you google "what is the best tool for doing x" are already manipulated, including reddit conversations.

by _heimdall6 hours ago|

parent|

[-]

Don't forget the YouTube videos, those "top 5 x" robot videos are the worst.

by adverbly6 hours ago|

prev|

[-]

Those sort of things are already highly biased because of the marketing spam that the modelsmare trained on.

I'd be more worried about AI convincing you that you need a product or expensive solution when you actually don't.

by HEX4AGON3 hours ago|

prev|

[-]

This has always been the case but with AI its going to get even worse. I mean a lot of people associate AI with higher "intelligence" sorta say, now you sprinkle in some political propaganda there from the highest bidder and you are going to have a big problem in the future especially if the populace ended up trusting these corpo AI blindly.

by thinkingtoilet21 minutes ago|

prev|

[-]

What were you doing before hand? Do you believe you were getting perfectly unbiased information?

by LastTrain5 hours ago|

prev|

[-]

Then you already can’t use it because it already doesn’t give you a result like that.

by justincormack5 hours ago|

prev|

[-]

There is no “true answer given all available infomation” maybe unless you give an eval function.