The latest version of our solution, DocsQuality, has been enhanced with the OCRIndex value calculation feature. This enables users to verify whether a PDF file submitted to an LLM model or a document circulation system will be correctly processed by an Optical Character Recognition (OCR) engine.
OCRIndex is a numerical measure indicating how well OCR (Optical Character Recognition) software can read text from electronic documents, including images or scanned writings. It considers the image quality, particularly font characteristics, and detects document defects such as compression, blurring, contrast, etc. A higher OCRIndex suggests a higher likelihood of accurate character recognition.