Acrobat XI ocr: access hidden layer?
hi all,
i have historic documents (german , english) want ocr text searchable *without* changing appearance. i've tried previous versions of acrobat did not quite work , thought give acrobat xi (windows 7) try.
i use "searchable image" in correct language ("clearscan" , "exact" not useful here). ocr'ed text in hidden layer. since old-fashioned font used, ocr result expectedly faulty. need correct results problems arise. little sub-menu allows me "problem areas" marked in red. individual entries can corrected 1 one. however, these changes not seem transferred hidden layer. evident either trying search term (ctrl f), or exporting word. both yield original, not corrected, ocr result.
second problem: once mark problem area solved, there no way access word, other starting on again.
third problem: keyboard shortcuts in submenu don't work.
improving scan quality no solution because older characters have no equivalent anyway.
the solution seems me access hidden text in way , edit directly there. did not find mentioning of in acrobat's or forums, however. expect it's still not possible?
(as aside, i'd submit problem adobe don't know how)
what you're asking not possible native tools in acrobat family, done plugins.
when run searchable image on file, text objects created on page have tag sets text rending mode 3 (which tells compliant pdf applications invisible selectable - in effect have no active stroke or fill). acrobat doesn't let change text rendering mode, there third-party 'cos editor' plugins (google vendors) can change any tags in file stream. don't have 'real' text in file, pseudo-workflow search , replace "/tr/3" tag "/tr/0" text visible, editing stuff, reset it.
More discussions in Scanning & OCR
adobe
Comments
Post a Comment