undefined

upvote

points

by wiether11 hours ago |

upvote

by therealpygon8 hours ago|

[-]

I agree with you in certain circumstances, but not really for internal user inference. OpenRouter is great if you need to maintain uptime, but for basic usage (chat/coding/self-agents) you can do all of what you mentioned and more with a LiteLLM instance. The number of companies that send a bill is rarely a concern when it comes to “is work getting done”, but I agree with you that minimizing user friction is best.

For general use, I personally don’t see much justification as to why I would want to pay a per-token fee just to not create a few accounts with my trusted providers and add them to an instance for users. It is transparent to users beyond them having a single internal API key (or multiple if you want to track specific app usage) for all the models they have access to, with limits and logging. They wouldn’t even need to know what provider is hosting the model and the underlying provider could be swapped without users knowing.

It is certainly easier to pay a fee per token on a small scale and not have to run an instance, so less technical users could definitely find advantage in just sticking with OpenRouter.

reply

upvote

by BeetleB5 hours ago|

[-]

The two things I like about OpenRouter:

1. The LLM provider doesn't know it's you (unless you have personally identifiable information in your queries). If N people are accessing GPT-5.x using OpenRouter, OpenAI can't distinguish the people. It doesn't know if 1 person made all those requests, or N.

2. The ability to ensure your traffic is routed only to providers that claim not to log your inputs (not even for security purposes): https://openrouter.ai/docs/guides/routing/provider-selection...

It's been forever since I played with LiteLLM. Can I get these with it?

reply

upvote

by napoleond5 hours ago|

[-]

> It doesn't know if 1 person made all those requests, or N.

FWIW this is highly unlikely to be true.

It's true that the upstream provider won't know it's _you_ per se, but most LLM providers strongly encourage proxies like OpenRouter to distinguish between downstream clients for security and performance reasons.

For example:

- https://developers.openai.com/api/docs/guides/safety-best-pr...

- https://developers.openai.com/api/docs/guides/prompt-caching...

reply

upvote

by BeetleB5 hours ago|

[-]

Fair point. Would be good to hear from OpenRouter folks on how they handle the safety identifier.

For prompt caching, they already say they permit it, and do not consider it "logging" (i.e. if you have zero retention turned on, it will still go to providers who do prompt caching).

reply

upvote

by Deathmax4 hours ago|

[-]

OpenRouter tells you if they submit with your user ID or anonymously if you hover over one of the icons on the provider, eg OpenAI has "OpenRouter submits API requests to this provider with an anonymous user ID.", Azure OpenAI on the other hand has "OpenRouter submits API requests to this provider anonymously.".

reply

upvote

by BeetleB4 hours ago|

[-]

But does "anonymous user ID" mean that they make a user ID for you, and it's sticky? If I make a request today and another tomorrow, the same anonymous user ID is sent each time? Or do they keep changing it?

reply

upvote

by instalabsai2 hours ago|

[-]

One additional major benefit of OpenRouter is that there is no rate limiting. This is the primary reason why we went with OpenRouter because of the tight rate limiting with the native providers.

reply

upvote

by BeetleB1 hours ago|

[-]

I think it's more accurate to say that they switch providers when there is rate limiting.

The underlying provider can still limit rates. What Openrouter provides is automatic switching between providers for the same model.

(I could be wrong.)

reply

upvote

by fg1374 hours ago|

[-]

> The number of companies that send a bill is rarely a concern

Not true in any non startup where there is an actual finance department

reply

upvote

by 4 hours ago|

[-]

deleted

reply

upvote

by wongarsu5 hours ago|

[-]

A lot of inference providers for open models only accept prepaid payments, and managing multiple of those accounts is kind of cumbersome. I could limit myself to a smaller set of providers, but then I'm probably overpaying by more than the 5.5% fee

If you're only using flagship model providers then openrouter's value add is a lot more limited

reply

upvote

by rvnx5 hours ago|

[-]

The main thing about Openrouter is also that they take 100% of the risk in case of overcharges from the models, you have an actual hard cap.

The minus is that context caching is only moderately working at best, rendering all savings nearly useless.

reply

upvote

by SR2Z3 hours ago|

[-]

Is there any risk? Don't the model providers also bill by the token?

reply

upvote

by fuzzy23 hours ago|

[-]

The accounting could be asynchronous, so you could overshoot your budget by a few requests before you're blocked.

reply

upvote

by cobertos5 hours ago|

[-]

Does OpenRouter perform better than LiteLLM on integration though? I found using Anthropic's models through a LiteLLM-laundered OpenAI-style API to perform noticably worse than using Anthropic's API directly. So I've scrapped considering LiteLLM as an option. It's also just a buggy mess from trying to use their MCP server. The errors it puts out are meaningless, and the UI behaves oddly even in the happy path (error message colored green with Success: prepended).

But if OpenRouter does better (even though it's the same sort of API layer) maybe it's worth it?

reply

upvote

by datadrivenangel4 hours ago|

[-]

LiteLLM had a major security incident recently, and often isn't actually that useful an abstraction...

reply

upvote

by vidarh4 hours ago|

[-]

I love Openrouter. The ability to define presets, and the ease of access is well worth the few vs. juggling lots of providers separately. I maintain a few subscriptions too - including the most expensive Claude subscription - but Openrouter handles the rest for me.

reply

upvote

by r0fl3 hours ago|

[-]

Love openrouter I can use cheap models without having to have an api at a bunch of different providers and can use the expensive models when im in a pinch and am maxed out from claude or codex

well worth the 5% they take

reply

upvote

by spaniard892772 hours ago|

[-]

You can get the same with kilo gateway without the fee.

reply

upvote

by 10 hours ago|

[-]

deleted

reply

upvote

by pixel_popping10 hours ago|

[-]

Expect you don't have the right to do what you want with the API Key (see waves of ban lately, many SaaS services have closed because of it).

reply

upvote

by embedding-shape10 hours ago|

[-]

Unless you provide some more details, at least outline what "do what you want" was in your case, this seems like just straight up FUD.

reply

upvote

by himata41139 hours ago|

[-]

openrouter accepts crypto so might have been some money laundering involved for reselling dirty crypto for llm api.

if that wasn't the reason, hey that's actually a great way to launder money (not financial advice).

reply

upvote

by embedding-shape9 hours ago|

[-]

So you pay OpenRouter with cryptocurrencies, which they accept as a payment method, and then what, they block your account because the cryptocurrencies you paid with came from some account on the blockchain associated with other stuff?

Or what are you really saying here? I don't understand how that's related to "you don't have the right to do what you want with the API Key", which is the FUD part.

reply

upvote

by himata41139 hours ago|

[-]

You pay openrouter with dirty crypto, then you have a business which simply resells openrouter giving you clean fiat. I think openrouter specifically only banned those kind of accounts since that's what I have observed from other comments / research. numlocked in this thread has explicitely said that they don't ban accounts for any of the reasons specified above which narrows down the scope to some form of broken ToS specifically around fraud and money laundering.

reply

upvote

by embedding-shape9 hours ago|

[-]

And then you go on HN and post "you don't have the right to do what you want"? Yeah, FUD and good riddance if so.

reply

upvote

by pixel_popping8 hours ago|

[-]

You are not allowed to resell Openrouter as an API yourself, so for example if you make a service that charge per token, you can't use Openrouter API for that, this is specified in their ToS, so no, you can't do what you want, what FUD?

Quote from their own TOS: access the Site or Service for purposes of reselling API access to AI Models or otherwise developing a competing service;

reply

upvote

by embedding-shape7 hours ago|

[-]

Yeah, you're not allowed to do things that are specifically spelled out in the ToS, how is this surprising? Of course you don't get "unlimited access to do whatever you technically can", APIs never worked like that, why would they suddenly work like that?

When you say "you don't have the right to do what you want with the API Key" it makes it sound like specific use cases are disallowed, or something similar. "You don't have the right to go against the ToS, for some reason they block you then!" would have been very different, and of course it's like that.

Bit like complaining that Stripe is preventing you from accepting credit card payments for narcotics. Yes, just because you have an API key doesn't mean somehow you can do whatever you want.

reply

upvote

by pixel_popping6 hours ago|

[-]

That's very different from the Stripe example, as opening a service like Openrouter isn't illegal, so that's only coming from it being opinionated, nothing to do with the law. And my example was for not so specific use cases but quite general one which is just to open let say a service like Opencode Zen and use Openrouter as a backend, this is explicity forbidden by Openrouter and it isn't against the law, that's not just a "niche use case".

Are we allowed yes or not to make a service that charge per Token to end-users, like giving access to Kimi K2.5 to end-users through Openrouter in a pay per token basis?

reply

upvote

by Vinnl9 hours ago|

[-]

That was a different user who wrote that.

reply

upvote

by embedding-shape8 hours ago|

[-]

Yeah, I didn't mean them specifically, more a general "you".

reply

upvote

by Vinnl2 hours ago|

[-]

Ah fair enough.

reply