undefined

points

[-]

I'm trying to recall a quote. Some war where all defeats were censored in the news, possibly Paris was losing to someone. It was something along the lines of "I can't help but notice how our great victories keep getting closer to home".

Last year I tried using an LLM to make a joke language, I couldn't even compile the compiler the source code was so bad. Before Christmas, same joke language, a previous version of Claude gave me something that worked. I wouldn't call it "good", it was a joke language, but it did work.

So it sucks at writing a compiler? Yay. The gloriously indefatigable human mind wins another battle against the mediocre AI, but I can't help but notice how the battles keep getting closer to home.

by sjsjsbsh6 hours ago|

parent|

[-]

> but I can't help but notice how the battles keep getting closer to home

This has been true for all of (known) human history. I’m gonna go ahead and make another bold prediction: tech will keep getting better.

The issue with this blog post is it’s mostly marketing.

by sebzim45007 hours ago|

prev|

[-]

Can one man really make a C compiler in one week that can compile linux, sqlite, etc.?

Maybe I'm underestimating the simplicity of the C language, but that doesn't sound very plausible to me.

by dmitrygr7 hours ago|

parent|

[-]

yes, if you do not care to optimize, yes. source: done it

by Philpax7 hours ago|

parent|

[-]

I would love to see the commit log on this.

by rustystump6 hours ago|

parent|

[-]

Implementing just enough to conform to a language is not as difficult as it seems. Making it fast is hard.

by dmitrygr7 hours ago|

parent|

prev|

[-]

did this before i knew how to git, back in college. target was ARMv5

by Philpax6 hours ago|

parent|

[-]

Great. Did your compiler support three different architectures (four, if you include x86 in addition to x86-64) and compile and pass the test suite for all of this software?

> Projects that compile and pass their test suites include PostgreSQL (all 237 regression tests), SQLite, QuickJS, zlib, Lua, libsodium, libpng, jq, libjpeg-turbo, mbedTLS, libuv, Redis, libffi, musl, TCC, and DOOM — all using the fully standalone assembler and linker with no external toolchain. Over 150 additional projects have also been built successfully, including FFmpeg (all 7331 FATE checkasm tests on x86-64 and AArch64), GNU coreutils, Busybox, CPython, QEMU, and LuaJIT.

Writing a C compiler is not that difficult, I agree. Writing a C compiler that can compile a significant amount of real software across multiple architectures? That's significantly more non-trivial.

by bwfan1234 hours ago|

prev|

[-]

> I can already feel the contracts coming to fix LLM slop

First, the agents will attempt to fix issues on their own. Most easy problems will be fixed or worked-around in this manner. The hard problems will require a deeper causal model of how things work. For these, the agents will give up. But, the code-base has evolved to a point where no-one understands whats going on including the agents and its human handlers. Expect your phone to ring at that point, and prepare to ask for a ransom.

by small_model7 hours ago|

prev|

[-]

Claude is only a few years old so we should compare it to a 3 year old human's C compiler

by notnullorvoid1 hours ago|

parent|

[-]

Claude requires many lifetimes worth of data to "learn". Evolution aside humans don't require much data to learn, and our learning happens in real-time in response to our environment.

Train Claude without the programming dataset and give it a dozen of the best programming books, it'll have no chance of writing a compiler. Do the same for a human with an interest in learning to program and there's a good chance.

by zephen6 hours ago|

parent|

prev|

[-]

Claude contains the entire wisdom of the internet, such as it is.

by sjsjsbsh7 hours ago|

prev|

[-]

> I can already feel the contracts coming to fix LLM slop like this when any company who takes this seriously needs it maintained and cannot

Honest question, do you think it’d be easier to fix or rewrite from scratch? With domains I’m intimately familiar with, I’ve come very close to simply throwing the LLM code out after using it to establish some key test cases.

by dmitrygr6 hours ago|

parent|

[-]

Rewrite is what I’ve been doing so far in such cases. Takes fewer hours