4n6geek0 Posted October 31, 2014 Report Share Posted October 31, 2014 Can Intella index a PDF of scanned document? What are the limitations on PDF indexing? Link to comment Share on other sites More sharing options...
admin Posted October 31, 2014 Report Share Posted October 31, 2014 HI 4n6geek0, PDF's: Intella can index all PDF except Scans and Content protected. For those you need OCR. Intella can work in conjunction with tools like AABBY when you do need to OCR such documents. Link to comment Share on other sites More sharing options...
Chris Posted November 3, 2014 Report Share Posted November 3, 2014 Note that a lot of scanners nowadays do the OCR on the fly as part of the scanning process and add the extracted text to the PDFs that they create. You can easily recognize such documents as they show the scanned image but at the same time allow the blurry text in it to be selected when viewed in e.g. Acrobat. When this is the case, you don't need to redo the OCR; Intella should extracted the text added by the scanner. Link to comment Share on other sites More sharing options...
4n6geek0 Posted November 12, 2014 Author Report Share Posted November 12, 2014 Is there anyway I can easily identify that a PDF item has scanned image in it? Link to comment Share on other sites More sharing options...
admin Posted November 12, 2014 Report Share Posted November 12, 2014 You can do this via the Empty document facet or by loading all PDF then viewing thumbnails. Link to comment Share on other sites More sharing options...
Recommended Posts