upvote
Firstly, congrats! As a Brit this looks cool, and I'm happy to see it. I wish you every success.

Secondly: I get that 'sovereign' is probably an important sales term for your company. But this, in common with the government's 'sov/ai' fund, does not deserve to be described as sovereign. This is other countries' models served on chips designed and manufactured abroad, powered by a grid which imports 44% of its power.

Of course this isn't your company's fault. Last week I went to an event where the sovereignai.gov.uk people presented. In a very Keir Starmer way (spiritually, he wasn't there), they said in as many words 'oh but I'm sure all reasonable people would agree _really_ sovereign AI would be too hard. So let's all agree to pretend that just popping a bit more money into the AI startup ecosystem is a sovereign AI strategy'.

I'm unsure if the UK does need to be sovereign in anything; it certainly doesn't seem to want to be. But I will continue to poke fun at anything using the pompous phrase 'sovereign' for anything that isn't.

If sovereign AI is a problem you're in earnest about, I hope you go after it seriously, and fix the rest of the stack. I'll cheer you on!

reply
Sovereign capability just means that no foreign government can pull the plug. Who cares where it was copied from?

If it were somehow legal for a company to provide MS Office (not a clone) fully in the UK with no control from Microsoft, that would also count as a sovereign capability, even though none of the code was written in the uk.

Maybe that's not how you like the term to be used but it's widely used that way and widely understood.

reply
> Sovereign capability just means that no foreign government can pull the plug

OK, fair enough on 'pull the plug ~instantly'. But models and chips age fast. If another country can stop you getting new models and chips, this means you're sovereign in state-of-the-art AI for only a window of a year or two (maybe this will widen if model progress tails off).

If it is a short window, strategically, that doesn't seem worth much given the timelines of: a) inter-state conflict, or trade wars b) cold-start time to be able to make your own models and chips

> Maybe that's not how you like the term to be used but it's widely used that way and widely understood.

Noted. But as a data-point, the audience at the event I mentioned (various AI builders and founders) made it clear from their questions to the speaker that the 'sovereign' that sov/ai was aiming at was hollow, for exactly the reasons I've stated.

reply
> But models and chips age fast. If another country can stop you getting new models and chips, this means you're sovereign in state-of-the-art AI for only a window of a year or two

OK, that's a fair point.

reply
The UK grid does not import 44% of its energy.

Gell-Mann suggests I should treat the rest of your post with skepticism.

EDIT: maybe you meant the UK total primary energy? I feel like that's extending the boundary a bit far. Should we start digging for uranium? Or stick to renewables, but only with locally sourced silica and rare earths?

reply
If I could give prizes for comments you'd get one. Too much fart sniffing goes on in these parts, it's always a pleasant change to see dissent
reply
Tbf the title only says sovereign _inference_
reply
Hey Ben. I find communication like this fairly off-putting. In so far the 80% cheaper per token (or any part of it) is something of your own making/ingenuity, by all means, do tell, but it requires comparing token cost fairly with comparable models on i.e. OpenRouter and not across different models and pretending it's the same thing.
reply
Marketing aside, why are you using the term "UK sovereign"?

I assume UK based DCs, so why not just say that, UK based LLM inference.

Is it a DC owned/ran by HM Gov? Is that why it's sovereign?

Not a criticism, more of a critique.

reply
Much the same way the word "patriotism" is more common in US national discourse, "sovereignty" is very common in UK national discourse.
reply
You're thinking of when chavs used to wear sovereign rings... it's fell out of fashion now tbh
reply
Data Sovereignty as a term is now fairly well established term that doesn't have specific government connotations e.g. https://events.linuxfoundation.org/kubecon-cloudnativecon-eu...
reply
He does say that? "running on NVIDIA Blackwell GPUs in the UK" is in there and that's pretty unambiguous.
reply
The problem is if those GPUs are running on an AWS server (or any other American provider), even if it the server is in the UK the sovereignty claim is null and void.
reply
Doesn't "We built it on fully UK sovereign cloud infrastructure, so data never leaves UK jurisdiction" cover that?
reply
In theory it should, but I've seen that language describing Azure "sovereign cloud" servers before. The data might indeed be stored in the UK, the problem is the CLOUD act which supersedes it.
reply
Is your business plan essentially to run mid tier models on hardware in the UK?

I do see the value in this as some enterprises need local data residency, the UK energy grid realistically can't handle new multi GW xAI-style data centres, and many applications don't need frontier models (but do need more than small local ones).

reply
Hi Ben - how are you positioning yourself vs LocAI? I had a few chats with them and they have a fairly similar pitch.
reply
We’re closely partnered with the LocAI model lab, we’re looking forward to running their models on the platform in the next few months!
reply
Ah great! Best of luck. They're a nice bunch.
reply
So the pricing is 12.50/month for unlimited chat, or 60p per million tokens output/10p per million input? For use with a coding assistant it would be the latter?
reply
good prices, but I don't see info on the token cache hits prices (in/out), are they available?.
reply
Congrats on the launch. More options for consumers in this space the better!
reply
why use this over openrouter?
reply
I'd expect for workflows where there is value in knowing that the data is processed in the UK. From a contractual/data protection standpoint, that could be very useful, depending on the use case.
reply
Avoiding routing through US or US-based companies.
reply
openrouter is an US based company, so falls under CLOUD Act
reply
good question, it's going to take a lot to dislodge openrouter from my workflows
reply