upvote
This is already the case with genealogical sites that have ML OCR creating searchable indices of handwritten documents.
reply
>automate transcription

this also means trusting the LLM to decide what things mean. but there is very likely a great middle ground of having LLMs take their best guesses and then verifying the output on significant finds. the risk is in LLM understating something important, false negatives, leading to putting stuff at the bottom of the pile that appears mundane but isnt

reply