undefined

points

[-]

This is sort of what their first sentence states? Except your line implies that they are fast in training and inference, they imply they are focusing on inference and are dropping training speed for it.

It's a nice opening as it is imo

by cubefox3 hours ago|

parent|

[-]

They don't say anything about dropping training speed.

by estearum33 minutes ago|

parent|

[-]

> a departure from Mamba-2, which optimized for training speed.

by E-Reverance5 hours ago|

prev|

[-]

The first sentence basically does though, no?

by robofanatic4 hours ago|

parent|

[-]

Of course my only objection was the language. LLMs are now old enough to leave the jargon behind and talk in simple easy to understand terms.

by oersted3 hours ago|

parent|

[-]

I’d argue the opposite, the terminology is fairly mainstream by now and “inference” has a much more specific sense than “making predictions”.

by mufasachan3 hours ago|

prev|

[-]

The blog is technical, technical terms in the TL;DR seems relevant to me.

by arendtio3 hours ago|

prev|

[-]

I don't get the downvotes, as I had trouble understanding the intro as well. It seems it was written for a very specific audience.

by qeternity3 hours ago|

parent|

[-]

Yes, it is written for a specific audience.

That is not a reason for snark.

As other commenters have noted, it’s well written.

by magicalhippo2 hours ago|

parent|

prev|

[-]

> I don't get the downvotes

Because the blog post is a technical one and the intro contains very common jargon, and the proposed alternative was wrong.

by camillomiller3 hours ago|

prev|

[-]

I don’t know why you’re being downvoted. As a longtime editor your version is immensely better. Looks like the original was probably not human-written.