Or something that tries to teach 60 languages but does so poorly ?
It's a browser extension that finds English sentences in webpages, and translates the ones at your difficulty level into the language you're learning.
"Translate Isolated Words" allows it to translate "sentences" of only one word, but it doesn't disable full sentences.
And yeah, atm it word splits by spaces for the dictionary. I hadn't thought to do it with LLMs, though that's a good idea. There's a somewhat related problem when doing Furigana, where it has a hashmap of strings-to-pronunciations, and it starts with a 4-character sliding window looking for matches, then a 3 character, etc.
- There's a global blacklist of sites, as well as phrases in the title/URL (e.g. "bank")
- You can blacklist sites yourself
- Each sentence is run against filters checking for medical/legal/etc info, as well as checks for addresses, card/social security numbers, etc. All the checks are done client side
- There are also some special implementations, e.g. it looks at the source code of websites to work out if they're an instance of an American health portal that I've forgotten the name of - each doctor's surgery self-hosts it.
- Websites can add `nuenki-ignore=true` on their end, if they'd like to disable it.
And of course it doesn't log anything, though there is an anonymous cache in order to make it economical.
At the moment I'm focused on translation quality, but I intend to add that.