It's a real concern! We take this stuff super seriously (
https://trust.mycroft.io/weave) and tbh most of our customers opt for the hosted version because it's much simpler on their end + they're already trusting us with a bunch of sensitive data.
But of course since the source is available you can also run it locally or self host