Comparison and Verification Tools

PDF Processor provides specialized tools for verifying extraction accuracy and comparing different versions of documents.


1. Visual Diff View

The Visual Diff tab is essential for verifying that the extraction engine has captured the document layout accurately.


2. Compare Diff View

The Compare Diff tab is activated when you load a second PDF file. It is designed for document versioning and change tracking.

Comparing Two PDFs

  1. Load your primary document using Open PDF.
  2. Load the second version using Compare PDF (✚).
  3. The Compare Diff tab will become active.

View Options

Diff Precision


3. Extraction Verification Workflow

To ensure 100% accuracy in your extracted documents:

  1. Perform the extraction.
  2. Switch to Visual Diff to check for missing paragraphs or misaligned tables.
  3. Use the HTML view for final proofreading.
  4. If comparing versions, use Compare Diff to ensure that changes in the PDF source are reflected correctly in your structured output.
Use the Status Bar at the bottom of the screen to monitor extraction progress. If the progress hangs, check the original PDF for excessive image-based text which may require more time for OCR processing.