undefined

[-]

What if you substituted "steel" with "asbestos" in your argument.

by izucken10 minutes ago|

[-]

Yeah but well you see, humans did not go extinct from just asbestos!

by irishcoffee1 hours ago|

[-]

Asbestos, lead paint, cigarettes, heroin(perscribed generously for basically whatever the doc felt like), "Radithor" (patent medicine containing radium-226 and 228, marketed as a "perpetual sunshine" energy tonic and cure for over 150 diseases), bloodletting, mercury treatments for syphilis, tobacco smoke enemas (yep that was a real thing), milk-based blood transfusions.

Didn't understand those either and used the fuck out of them because "the experts" said we should.

by JoshGG2 hours ago|

[-]

Which year did we use steel to replace human workers and automate decision-making?

[-]

Around 1928ish

by carlosjobim2 hours ago|

[-]

The entire industrial revolution was steel replacing human workers. And that is still the backbone of the world today. We are still living the industrial revolution.

Just like the invention of fire happened ages ago, but is still a crucial part of life today.

by surgical_fire13 minutes ago|

[-]

No, it was actually engines.

The mechanism behind engines were fully understood, any experiments with engines were reproducible and measurable. You could get an engine and create schematics by reverse engireening it.

LLMs, useful as they may be, are not that.

by almostdeadguy2 hours ago|

[-]

Famously Andrew Carnegie spent years trying to get the steel to stop talking about goblins.

by gus_massa14 minutes ago|

[-]

Steel is almost magic. Stainless steel is beyond magic.

I had a specialization in Chemistry in High School. For some analysis, the fist step is to dissolve everything in boiling Nitric Acid. But stainless steel has Chrome is like a spell of protection, so you must use boiling Hydrochloric Acid instead. I have no idea why. It's just like magic. It may have Nickel, Molybdenum, and other metals, that give it more magical properties.

A few years ago there was a nice post about copying a normal steel alloy for knives to get an equivalent made of stainless steel. You need to reduce the the Carbon content to make it less brittle. And they had to add Vanadium so it keeps the sharpness of the knives. I have no idea why. It's just like magic.

If you have half an hour, it's worth reading, but beware that it has too many technical details that are close to magical https://knifesteelnerds.com/2021/03/25/cpm-magnacut/ (HN discussion https://news.ycombinator.com/item?id=29696120 | 375 points | Dec 2021 | 108 comments)

[-]

Famously Andrew Carnegie dodged the point

by almostdeadguy1 hours ago|

[-]

That the industrial revolutions use of steel to augment or replace labor was similar in every way to using LLMs to do the same? Seems on point to me.

by qwery1 hours ago|

[-]

Assuming your timeline and metallurgical claims to be true, you're conflating engineering and (materials) science.

Humans have been using steel for however long, when and where it was understood to be an appropriate solution to a problem. In some sense, engineering is the development and application of that understanding. You do not need to have a molecular explanation of the interaction between carbon and iron to do effective engineering[-1] with steel.[0] Science seeks to explain how and why things are the way they are, and this can inform engineering, but it is not prerequisite.

I think that machine learning as a field has more of an understanding of how LLMs work than your parent post makes out. But I agree with the thrust of that comment because it's obvious that the reckless startups that are pushing LLMs as a solution to everything are not doing effective engineering.

[-1] "effective engineering" -- that's getting results, yes, but only with reasonable efficiency and always with safety being a fundamental consideration throughout

[0] No, I'm not saying that every instance of the use of steel has been effective/efficient/safe.

by ashtonshears1 hours ago|

[-]

Poor correlation comparing physical material to computer technology

by idiotsecant45 minutes ago|

[-]

Why

by datsci_est_201517 minutes ago|

[-]

Let me just quickly use absurdism to illustrate why argument by analogy is weak (and unfortunately overused on HN):

“”” Humanity has been using celibacy for over a millenia, however it's only in the past 100 years or so we have a good understanding of not having sex affects the psychology of a person, turning them into an ubermensch. Based on this argument, we should have never stopped having sex, until we had a complete first principles understanding. “””

Analogies can produce a lot of words, making it appear to be a high effort comment, but it also shifts the argument to why or why not an analogy is good or not, and away from the points the original poster was trying to make. And, by Sturgeon’s Law, most analogies are utter crap on top of being an already weak way to form an argument.

by surgical_fire16 minutes ago|

[-]

This is a very low-effort argument.

Humans could understand properties of steel long before they knew how Carbon interacted with Iron. Steel always behaved in a predictable, reproducible way. Empirical experiments with steel usage yielded outputs that could be documented and passed along. You could measure steel for its quality, etc.

The same cannot be said of LLMs. This is not to say they are not useful, this was never the claim of people that point at it's nondeterministic behavior and our lack of understanding of their workings to incorporate them into established processes.

Of course the hype merchants don't really care about any of this. They want to make destructive amounts of money out of it, consequences be damned.

by nutjob23 hours ago|

[-]

That's not his point at all. He advocates using LLMs.

The correct analogy is: if we just scale and improve steel enough, we'll get a flying car.

by lukan3 hours ago|

[-]

Well, we did build airplanes out of steel, but there are better (lighter) materials avaiable. But the developement of car engines did directly enabled airplane engines. Not sure if this is the right analogy path, but I kind of suspect similar with LLM's/transformers. They will be a important part.

by jagged-chisel2 hours ago|

[-]

An important stepping stone, perhaps. But I don’t think the final AGI thing will necessarily contain LLMs.

by lukan1 hours ago|

[-]

I don't know. I know I used to be pretty AI sceptic, until they became good enough to help with non trivial code problems on their own.

I strongly suspect, that we will come to a point, where it gets impossible to tell if something is AGI and consciouss or not.

by dreis_sw58 minutes ago|

[-]

History shows continuous evolution, there won't be a "final AGI thing". The definition of AGI is so vague anyways that any conversation around it is hardly useful. 5 years ago, what we have today would have been considered AGI.

by nutjob255 minutes ago|

[-]

> Well, we did build airplanes out of steel, but there are better (lighter) materials avaiable.

That's exactly my point. In this analogy LLMs are steel, but the flying things are made out of aluminum, lithium and titanium and not steel. We need a better idea than LLMs because LLMs's are not suddenly going to turn into something they are not.

[-]

We literally did that though. Walk outside and look up.

by hansmayer1 hours ago|

[-]

Oh for crying out loud! Let's stop inventing fake analogies to justify the inherent LLM shortcomings! Those of us who are critical - are only using the standards that the LLM companies set themselves ("superintelligence", "pocket phds" bla blabla), to hold them accountable. When does the grift stop?

by 35 minutes ago|

[-]

deleted

by dakolli3 hours ago|

[-]

pro LLM people are the kings of ad hoc fallacy. Why did you type this? You can consistently test steel and get a good idea of when and where it will break in a system without knowing its molecular structure.

LLMs are literally stochastic by nature and can't be relied on for anything critical as its impossible to determine why they fail, regardless of the deterministic tooling you build around them.

[-]

> LLMs are literally stochastic by nature and can't be relied on for anything critical

Ahh, yes, unlike humans, who are completely deterministic, and thus can be trusted.

[-]

Humans can be governed by rules with consequences and replaced with individuals with a appropriate level of risk taking / rule following for the role.

by somewhatgoated1 hours ago|

[-]

Rules and consequences seem to apply to humans in a similar way as prompts and harnesses govern LLMs. The greater the level of power a human possesses the less they are governed by these restraints, this doesnt apply to LLMs so at least in that aspect they are an improvement. But yea we can’t really punish or inflict pain on them - this seems like a problem

[-]

I think a simpler model is variety.

There are billions of people, you can interview/hire/fire until you get the right match.

There are 2? frontier LLM providers. 5? if you are more generous / ok with more trailing edge.

Everyone thought OpenAI was great, until Claude got better in Q1 and they switched to Anthropic, and then Codex got better and a good chunk moved back to OpenAI.. Seems kind of binary currently.

by handoflixue1 hours ago|

[-]

Why does it matter if you can inflict pain on them? Is that normal and acceptable in your line of work?

by wild_egg1 hours ago|

[-]

Being able to fire someone, thus causing potentially significant hardship, is considered quite normal and acceptable in most lines of work.

by somewhatgoated41 minutes ago|

[-]

Yea I didn’t mean actual physical violence but rules need painful consequences in some way to be meaningful?

by GigaDingus1 hours ago|

[-]

Which has, famously, been a great consolation for people who suffered the consequences of human failure in the past

by handoflixue1 hours ago|

[-]

That seems like it applies just fine to LLMs as well: You can replace an LLM with a different model, different prompts, etc. for the appropriate level of risk taking. Rule following is even easier, given you can sandbox them.

[-]

Theres at best a handful of frontier models vs billions of people and millions of SWEs.

[-]

You clearly have never met a human

[-]

If you cannot get humans to do roughly what you want as a manager, good luck with LLMs.

by hansmayer1 hours ago|

[-]

Wow, such a nasty view to hold. What's next, the Altman's bullshit argument about "all the food" that the humans need to grow up and develop brain ? Humans are intelligent. Humans can generalise and invent new concepts, ideas and art. LLMs are none of that.

by keybored2 hours ago|

[-]

What is the ad hoc fallacy? From googling I didn’t find any convincing definitions (definitions that demonstrate that it is a logical fallacy).

by jibal2 hours ago|

https://finmasters.com/ad-hoc-fallacy/

[-]

> Ad hoc fallacy is a fallacious rhetorical strategy in which a person presents a new explanation – that is unjustified or simply unreasonable – of why their original belief or hypothesis is correct after evidence that contradicts the previous explanation has emerged.

https://cerebralfaith.net/logical-fallacy-series-part-13-ad-...

> An argument is ad hoc if its only given in an attempt to avoid the proponent’s belief from being falsified. A person who is caught in a lie and then has to make up new lies in order to preserve the original lie is acting in an ad hoc manner.

It should be clear why the ad hoc fallacy is a fallacy.

by abcde6667773 hours ago|

[-]

Where did he say not to use LLMs? Oh that's right: he didn't.

by jsenn2 hours ago|

[-]

The article you are responding to showed that a strange LLM behaviour was caused by a training signal that was explicitly designed to produce that type of behaviour. They were able to isolate it, clearly demonstrate what happened, and roll out a mitigation using a mechanism they engineered for exactly this type of thing (the developer prompt). That doesn’t sound like sorcery to me. If anything I’m surprised you can so easily engineer these things!

by harrouet2 hours ago|

[-]

The article I am responding to (which I've read) shows that these LLMs come with all sorts of hacks (= context bits) to make it behave more like this or more like that.

There is probably a whole testing workflow at AI companies to tweak each new model until it "looks" acceptable.

But they still don't understand what they are doing. This is purely empirical.

by airstrike47 minutes ago|

[-]

That all of their model outputs should be influenced by whatever personality prompt voodoo the wise artisan at OpenAI decided to stuff it with during RL should give everyone pause.

That Nerdy personality prompt made me gag. As a card-carrying Nerd, I feel offended

by LeonB2 hours ago|

[-]

…months after it began.

by killerstorm3 hours ago|

[-]

What does LLM need to do for you to consider it "smart"?

To me they seem to be pretty damn smart, to put it mildly. They sometimes do stupid things - but so do smart people!

by benrutter2 hours ago|

[-]

Not OP, but I think the argument here would be not that LLMs "are not smart" but that smart is just the wrong category of thing to describe an LLM as.

A calculator can do very complex sums very quickly, but we don't tend to call it "smart" because we don't think it's operating intelligently to some internal model of the world. I think the "LLMs are AGI" crowd would say that LLMs are, but it's perfectly consistent to think the output of LLMs is consistent/impressive/useful, but still maintain that they aren't "smart" in any meaningful way.

[-]

> "we don't think it's operating intelligently to some internal model of the world"

Okay, but you have to actually address why you think LLMs lack an "internal model of the world"

You can train one on 1930s text, and then teach it Python in-context.

They've produced multiple novel mathematical proofs now; Terrance Tao is impressed with them as research assistants.

You can very clearly ask them questions about the world, and they'll produce answers that match what you'd get from a "model" of the world.

What are weights, if not a model of the world? It's got a very skewed perspective, certainly, since it's terminally online and has never touched grass, but it still very clearly has a model of the world.

I'd dare say it's probably a more accurate model than the average person has, too, thanks to having Wikipedia and such baked in.

by dgellow1 hours ago|

[-]

They aren’t smart, they approximate language constructs. They don’t have believes, ideas, etc. but have a few rounds of discussions with any LLMs and you see how they are probabilistic autocompletes based on whatever patterns from rounds of discussions you feed them

by lxgr1 hours ago|

[-]

At what point does autocomplete stop being "just autocomplete"?

Clearly there's a limit. For example, if an alien autocomplete implementation were to fall out of a wormhole that somehow manages to, say, accurately complete sentences like "S&P 500, <tomorrow's date>:" with tomorrow's actual closing value today, I'd call that something else.

by dgellow44 minutes ago|

[-]

You can call it however you want. The point of using the term autocomplete is to make the underlying technology relatable and remove the mystic from it. In any case, your alien autocomplete wouldn’t be an LLM if it can predict the future

> At what point does autocomplete stop being "just autocomplete"?

Every single discussion on the internet is a repeat of https://en.wikipedia.org/wiki/Loki%27s_wager it seems…

by bilekas2 hours ago|

[-]

> To me they seem to be pretty damn smart

That's the sorcery mentioned in the GP, the issue comes when people believe it to be smart however in reality it is just a next word prediction. Gives the impression it's actually thinking, and this is by design. Personally I think it's dangerous in the sense it gives users a false sense of confidence in the LLM and so a LOT of people will blindly trust it. This isn't a good thing.

by jeremyjh1 hours ago|

[-]

I'm curious how you think "word predictor" meaningfully describes an instruct model that has developed novel mathematical proofs that have eluded mathematicians for decades?

edit:

You cannot predict all the actions or words of someone smarter than you. If I could always predict Magnus Carlsen's next chess move, I'd be at least as good at chess as Magnus - and that would have to involve a deep understanding of chess, even if I can't explain my understanding.

I can't predict the next token in a novel mathematical proof unless I've already understood the solution.

by GigaDingus1 hours ago|

[-]

I think that's more of a limitation in how people think about word predictors

If you can predict the words a bright person will say about X... Isn't that some truly astounding tool? That could be used in myriad useful ways if one is a little creative with it

Since it's also "alien" it can also detect and explore paths that we simply haven't noticed since their biases aren't quite the same as ours

by 1 hours ago|

[-]

deleted

[-]

What's the difference between "smart" and "next word prediction", at this point? Back when they first came out, sure, but now they can write code and create art.

What would it take for you to concede a future model was smart?

by bilekas1 hours ago|

[-]

My personal take would always be that it produces something that isn't in the training set, ie: Demonstrable Creativity, or innovation.

For example, it's training set it purely engineering and code with general language data set, would be "aware" what art is, but has never seen an artistic image, aware what colours are and able to create something it never saw before.

Like a child with a paintbrush, there is an intuitive behavior that happens.

by handoflixue1 hours ago|

[-]

Can you name any examples of a human doing this? I learned about colors, color theory, and so forth in school. I've definitely seen artistic images before.

They can already create something they've never seen - you can prompt ChatGPT to generate images, and there's a few dedicated models for it: https://chatgpt.com/images/

Terence Tao feels like they've done innovative work on mathematics: https://www.scientificamerican.com/article/amateur-armed-wit...

by hansmayer1 hours ago|

[-]

How about writing "all code" this June, as Dario Amodei announced in January this year?

by steve19771 hours ago|

[-]

Are they smart or are they imitating things smart people did? (and if so, is there a difference?)

by nutjob23 hours ago|

[-]

LLMs are amazing. You can call them 'smart', but they're not intelligent and never will be.

They are useful but a cul de sac for heading toward AGI.

[-]

HN sober AI take of the day coming from a guy with nutjob for his handle, thank you.

by jiggawatts2 hours ago|

[-]

You can always redefine "intelligent" so that humans meet the requirements but AIs don't.

A better model to use is this: LLMs possess a different type of intelligence than us, just like an intelligent alien species from another planet might.

A calculator has a very narrow sort of intelligence. It has near perfect capability in a subset of algebra with finite precision numbers, but that's it.

An old-school expert system has its own kind of intelligence, albeit brittle and limited to the scope of its pre-programmed if-then-else statements.

By extension, an AI chat bot has a type of intelligence too. Not the same as ours, but in many ways superior, just as how a calculator is superior to a human at basic numeric algebra. We make mistakes, the calculator does not. We make grammar and syntax errors all the time, the AI chat bots generally never do. We speak at most half a dozen languages fluently, the chat bots over a hundred. We're experts in at most a couple of fields of study, the chat bots have a very wide but shallow understanding. Etc.

Don't be so narrow minded! Start viewing all machines (and creatures) as having some type of intelligence instead of a boolean "have" or "have not" intelligence.

by slumberlust1 hours ago|

[-]

> A calculator has a very narrow sort of intelligence.

Have you ever heard anyone refer to a calculator as intelligent?

These companies have a vested interest in making the product appear more human/smart than it is. It's new tech smeared with the same ole marketing matter.

by skydhash2 hours ago|

[-]

Would you say that a display and a printer are a perfect painter because they can render images? And a speaker is a very good musician because they can produce sound?

The LLM tasks is to produce a string of words according to an internal model trained on texts written by humans (and now generted by other LLMs). This is not intelligence.

[-]

Okay, but why isn't it "intelligence"? What part of the definition does it fail? What would convince you that you're wrong?

by skydhash1 hours ago|

[-]

I wouldn’t say it’s a general definition, but the consensus (according to my opinion) is that intelligence is being able to define problems (not just experience them), discern the root cause, and then solve that.

Where it fails is generally the first step. It’s kinda like the old saying “you have to ask the right question”. In all problem solving matters, the definition of problem is the first step. It may not be the hardest (we have problems that are well defined, but unresolved), but not being able to do it is often a clear indication of not being able to do the rest.

> What would convince you that you're wrong?

Maybe when I can have the same interaction as with my fellow humans, where I can describe the issue (which is not the problem) and they can go solve it and provide either a sound plan to make the issue disappear. Issue here refer to unpleasantness or frustrating situation.

Until then, I see them as tools. Often to speed up my writing pace (generic code and generic presentation), or as a weird database where what goes in have a high probability to appear.

by squidbeak2 hours ago|

[-]

Your argument doesn't seem to allow that the intelligence & versatility within that mystery could exceed ours to such a degree that AGI would be the only term that makes sense for it. By your own logic, if we don't understand how these things really work, it's foolish to declare there's a limit to their potential.

by jgilias1 hours ago|

[-]

It’s not sorcery tech at all. Nothing in their “goblin post mortem” is surprising the least bit if you have a working high-level mental model of what an LLM is.

It’s a fancy autocomplete that takes a bunch of text in and produces the most “likely” continuation for the source text “at once and in full”. So when you add to the source text something like: “You’re an edgy nerd”, it’s very much not surprising that the responses start referencing D&D tropes.

If you then use those outputs to train your base models further it’s not at all surprising that the “likely” continuations said models end up producing also start including D&D tropes because you just elevated those types of responses from “niche” to “not niche”.

The post-mortem is hilarious in that sense. “Oh, the goblin references only come up for ‘Nerdy’ prompt”. No shit.

by ZunarJ53 hours ago|

https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-...

[-]

by chris_st1 hours ago|

[-]

...it came as a surprise that [leaving a Petri dish out with a window open] would end up with interesting [molds] (called [penicillin]). _It was not planned at all_.

by hypendev2 hours ago|

[-]

Not sure if we read the same post, as I cannot agree with this claim, especially under this post that exactly goes into details of what happened.

>LLM is a sorcery tech that we don't understand at all

We do, and I'm sure that people at OpenAI did intuitively know why this is happening. As soon as I saw the persona mention, it was clear that the "Nerdy" behavior puts it in the same "hyperdimensional cluster" as goblins, dungeons and dragons, orcs, fantasy, quirky nerd-culture references. Especially since they instruct the model to be playful, and playful + nerdy is quite close to goblin or gremlin. Just imagine a nerdy funny subreddit, and you can probably imagine the large usage of goblin or gremlin there. And the rewards system will of course hack it, because a text containing Goblin or Gremlin is much more likely to be nerdy and quirky than not. You don't need GPT 5 for that, you would probably see the same behavior on text completion only GPT3 models like Ada or DaVinci. They specifically dissect how it came to this and how they fixed it. You can't do that with "sorcery we dont understand". Hell, I don't know their data and I easily understood why this is going on.

>they want you to think that LLMs are smart beasts (they are not)

I mean, depends on what you consider smart. It's hard to measure what you can't define, that's why we have benchmarks for model "smartness", but we cannot expect full AGI from them. They are smart in their own way, in some kind of technical intelligence way that finds the most probable average solution to a given problem. A universal function approximator. A "common sense in a box" type of smart. Not your "smart human" smart because their exact architecture doesn't allow for that.

>and that we know what LLMs are doing (we don't)

But we do. We understand them, we know how they work, we built thousands of different iterations of them, probing systems, replications in excel, graphic implementations, all kinds of LLM's. We know how they work, and we can understand them.

The big thing we can't do as humans is the same math that they do at the same speed, combining the same weights and keeping them all in our heads - it's a task our minds are just not built for. But instead of thinking you have to do "hyperdimensional math" to understand them 100%, you can just develop an intuition for what I call "hyperdimensional surfing", and it isn't even prompting, more like understanding what words mean to an LLM and into which pocket of their weights will it bring you.

It's like saying we can't understand CPU's because there is like 10 people on earth who can hold modern x86-64 opcodes in their head together with a memory table, so they must be magic. But you don't need to be able to do that to understand how CPU's work. You can take a 6502, understand it, develop an intuition for it, which will make understanding it 100x easier. Yeah, 6502 is nothing close to modern CPU's, but the core ideas and concepts help you develop the foundations. And same goes with LLM's.

>personally side with Yann Le Cun in believing that LLM is not a path to AGI

I agree, but it is the closest we currently have and it's a tech that can get us there faster. LLM's have an insane amount of uses as glue, as connectors, as human<>machine translators, as code writers, as data sorters and analysts, as experimenters, observers, watchers, and those usages will just keep growing. Maybe we won't need them when we reach AGI, but the amount of value we can unlock with these "common sense" machines is amazing and they will only speed up our search for AGI.

by jeremyjh1 hours ago|

https://arxiv.org/html/2210.13382v5

[-]

We understand the low level details of how they are constructed. But we do not fully understand how higher-level behavior emerges - it is a subject of active research.

For example:

https://arxiv.org/abs/2109.06129

by hypendev1 hours ago|