Overview

This feature helps in understanding the results generated or the classifications done by DOCUMENT_ASEMBLER plugin. The user can input some files for classification in the “test-classification” folder present in the batch class folder. The input tiff/pdf files could be single page or multi-page files. All the files present in the folder will be processed according to the settings in the batch class. All the required plugins which are required for DOCUMENT_ASEMBLER plugin to execute for a particular configuration need to be present in the batch class workflow. By default the classification set in DOCUMENT_ASEMBLER plugin will be performed. For further classifications a drop-down will be provided through which four set of classifications can be performed: Search Classification, Barcode Classification, Image Classification and Automatic Classification. The results are shown in a tabular form with following details about the images put for classification:

  1. Classification Type: The type of classification performed on the set of inputs.
  2. Document Type: the type of document classified for this input page.
  3. Document Identifier: The identifier of the document classified.
  4. Document Confidence: The confidence value for this document generated.
  5. Page Name: The name of the page given as input for classification.
  6. Page Identifier: The identifier of the page in accordance to the inputs provided.
  7. Page Classification: The classification done for the page. This contains the name of document type for which image is classified with the name of page type i.e. FIRST_PAGE, MIDDLE_PAGE or LAST_PAGE.
  8. Page Confidence: The confidence generated for this page after classification.
  9. Classification Sample: The name of the classification/learned file name from the “lucene-search-classification-sample” folders. This value is only populated in case of Search Classification. For all other classification it is set as “NA”.

 

400px-3.1_TableExtractionSugggestionBoxSwitch_1001

 

Characteristics

  • This feature provides the results of the classification performed on the provided set of inputs.
  • The classification results which would be generated via batch processing can be viewed in a tabular form.
  • Using this functionality, users can configure their batch class for optimum and best fit classification results. This classification can be executed for different settings and results can be compared.
  • The color code functionality highlights the document/page rows for which document confidence is lower than the threshold value of the classified document. These rows depict the documents which would require user interference in Review Validate modules.
  • The feature also provides us the functionality of download the resulting batch xml structured output which contains details of all the details of processing.
  • Clear button functionality clears all the input files, the results generated and backup folders for the processing.

Steps of execution/working

  • Input the files have to be put in the “test-classification” folder. If the folder is not existing then click of “Test Classification” button on Document Type tab will generate the folder itself.
  • After copying the files for classification in the “test-classification” folder, open the batch class in which the files have been put.
  • In the batch class configure the workflow in which classification results need to be tested. Set the properties of different plugins required for proper classification of input files into documents.
  • On click of classification button the classification type set in the batch class for DOCUMENT_ASSEMBLER plugin is executed.
  • For checking results of any other classification for same inputs, set the desired classification type in the drop-down and click “Classify” button. On click of “Classify” button the selected classification is executed on the input images and the results are displayed similarly.
  • The results of classification can be downloaded by the click of “Download” button.
  • The input files could be cleared by use of “Clear” button.

Troubleshooting

Following are few common error messages received due to mal-functioning of the plugin:

 

Just test table
S. No. Error message Possible root cause
1 Unable to perform Classification. Test Classification Folder does not exist. The folder is created now. Please put files for classification. There is no "test-classification" folder in the batch class folders in Shared Folders. The folder is now generated.
2 There are no proper input files present. Please upload tif, tiff or pdf files. There are no proper input files present in the "test-classification folder" for classification.
3 Incomplete properties of the Search Classification plugin for the specified batch class. Plugin configuration is incorrect or the plugin does not exist in the batch class workflow.
4 Problem fetching the properties of Barcode plugin. Please verify the plugin in batch class. Plugin configuration is incorrect or the plugin does not exist in the batch class workflow.
5 Please check Classify Images plugin in Page Process module for its properties in batch class. Plugin configuration is incorrect or the plugin does not exist in the batch class workflow.

 

Was this article helpful to you?

wikiadmin

Comments are closed.