I don’t have the docs on-hand now on my cellphone, but here’s the kind of errors it was making (and blaming the document’s OCR from an original PDF scan):
View attachment 1770982
I expect the premium language model to recognise it is creating junk data: if it doesn’t know basic validation (like don’t place parts of physical addresses under a column labelled “Tel. no” and don’t place “Tel. Email” under the email column) then I can’t help it. It just doesn’t understand context correctly, and that’s gonna catch it out if it doesn’t scan for it and correct itself).