> Wait really? I honestly would have thought this was a solved problem by now, especially high quality transcriptions bit, just out of curiosity, is the problem that the quality isn't high enough?
If I had to guess, all of those apps are probably vibecoded, hence the variable quality.