undefined

points

by gbnwl11 hours ago |

comments

by satvikpendem10 hours ago|

[-]

Don't keep up. Much like with news, you'll know when you need to know, because someone else will tell you first.

by vessenes5 hours ago|

parent|

[-]

This is only good advice if you don’t have the need to understand what’s happening on the edge of the frontier. If you do, then you’ll lose on compounding the knowledge from staying engaged with the major developments.

by satvikpendem2 hours ago|

parent|

[-]

Not all developments are equal. Many are experimental branches of testing things out that usually get merged back into the core, so to speak. For example, I knew someone who was full into building their own harness and implementing the Ralph loop and various other things, spending a lot of time on it and now, guess what? All of that is in Claude Code or another harness and I didn't have to spend any amount of time on it because ultimately they're implementation details.

It's like ricing your Linux distro, sure it's fun to spend that time but don't make the mistake of thinking it's productive, it's just another form of procrastination (or perhaps a hobby to put it more charitably).

by wordpad11 hours ago|

prev|

[-]

The players barely ever change. People don't have problems following sports, you shouldn't struggle so much with this once you accept top spot changes.

by gbnwl10 hours ago|

parent|

[-]

I didn't express this well but my interest isn't "who is in the top spot", and is more _why and _how various labs get the results they do. This is also magnified by the fact that I'm not only interested in hosted providers of inference but local models as well. What's your take on the best model to run for coding on 24GB of VRAM locally after the last few weeks of releases? Which harness do you prefer? What quants do you think are best? To use your sports metaphor it's more than following the national leagues but also following college and even high school leagues as well. And the real interest isn't even who's doing well but WHY, at each level.

by yorwba7 hours ago|

parent|

[-]

The technical report discussing the why and how is here: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main...

by renticulous9 hours ago|

parent|

prev|

[-]

Follow the AI newsletters. They bundle the news along with their Op-Ed and summarize it better.

by stef255 hours ago|

parent|

[-]

Tips on what newsletters are worth signing up for ?

by anonymousDan7 hours ago|

parent|

prev|

[-]

Can you suggest some good ones?

by namnnumbr2 hours ago|

parent|

[-]

I really like latent.space and simonwillison.com.

Also (shameless self-promo) I publish a 2x weekly blog just to force myself to keep up: https://aimlbling-about.ninerealmlabs.com/treadmill/

by yorwba7 hours ago|

parent|

prev|

[-]

https://jack-clark.net/

by danielkempe2 hours ago|

parent|

prev|

[-]

[dead]

by ehnto10 hours ago|

parent|

prev|

[-]

It is funny seeing people ping pong between Anthropic and ChatGPT, with similar rhetoric in both directions.

At this point I would just pick the one who's "ethics" and user experience you prefer. The difference in performance between these releases has had no impact on the meaningful work one can do with them, unless perhaps they are on the fringes in some domain.

Personally I am trying out the open models cloud hosted, since I am not interested in being rug pulled by the big two providers. They have come a long way, and for all the work I actually trust to an LLM they seem to be sufficient.

by dannyw3 hours ago|

parent|

[-]

Their financial projections that to a big part their valuation and investor story is built on involves actually making money, and lots of money, at some point. That money has to come from somewhere.

by DiscourseFan10 hours ago|

parent|

prev|

[-]

I find ChatGPT annoying mostly

by awakeasleep10 hours ago|

parent|

[-]

Open settings > personalization. Set it to efficient base style. Turn off enthusiasm and warmth. You’re welcome

by 2ndorderthought4 hours ago|

parent|

[-]

Yea but even then it's still annoying. "It's not about the enthusiasm and warmth but the general tone"

by layer81 hours ago|

parent|

[-]

Setting “base style and tone” to “efficient” works fine for me.

by dnnddidiej5 hours ago|

prev|

[-]

https://commoncog.com/how-to-make-sense-of-ai/

by vrganj10 hours ago|

prev|

[-]

It honestly has all kinda felt like more of the same ever since maybe GPT4?

New model comes out, has some nice benchmarks, but the subjective experience of actually using it stays the same. Nothing's really blown my mind since.

Feels like the field has stagnated to a point where only the enthusiasts care.

by ifwinterco8 hours ago|

parent|

[-]

For coding Opus 4.5 in q3 2025 was still the best model I've used.

Since then it's just been a cycle of the old model being progressively lobotomised and a "new" one coming out that if you're lucky might be as good as the OG Opus 4.5 for a couple of weeks.

Subjective but as far as I can tell no progress in almost a year, which is a lifetime in 2022-25 LLM timelines

by dannyw3 hours ago|

parent|

[-]

Another annoyance (for more API use) is summarized/hidden reasoning traces. It makes prompt debugging and optimization much harder, since you literally don't have much visibility into the real thinking process.

by trueno10 hours ago|

prev|

[-]

holy shit im right there with you