Folder Import

Plugin Name Notes Can I remove this?
IMPORT_MULTIPAGE_FILES It makes sure all the files are single pages which is a requirement for Ephesoft. Red
IMPORT_BATCH_FOLDER Do not remove this plugin. This plugin creates the BI folder and copies the single page tiff files into it.
Red.gif

Page Process

Plugin Name Notes Can I remove this?
CREATE_OCR_INPUT This can be removed if the client is not processing color documents. Green
RECOSTAR_HOCR This can be removed if Tesseract is used for Full Page OCR.
Green.gif
TESSERACT_HOCR This can be removed if Recostar is used for Full Page OCR.
Green.gif
NUANCE_HOCR This plugin is used only in the Linux Version. It can be removed if there is no requirement for OCR
Green.gif
CREATE_DISPLAY_IMAGE This can be removed ONLY if this batch class does not contain Review and Validation.
Green.gif
CREATE_THUMBNAILS This can be removed ONLY if this batch class does not contain Review and Validation.
Green.gif
CLASSIFY_IMAGES This can be removed if Image/Layout based Classification is not used.
Green.gif
HTML_TO_XML This plugin is used for content/search classification and free form extraction. It can be removed if your on Ephesoft version 3.1 and higher. There is no need for this plugin in the later versions
Green.gif
BARCODE_READER This can be removed if barcode classification is not used. Please note that RECOSTAR_HORC can also do barcode detection. If Recostar is configured for Barcode, this plugin should be removed.
Green.gif
SEARCH_CLASSIFICATION This can be removed if content based classification is not used.
Green.gif
KV_PAGE_PROCESS This can be removed if Extraction based classification is not used.
Green.gif
PAGE_PROCESS_SCRIPTING_PLUGIN This can be removed if there is no scripting needed at the Page Processing level.
Green.gif

Document Assembly

Plugin Name Notes Can I remove this?
DOCUMENT_ASSEMBLER This plugin decides where the documents starts and ends based on classification methods used in Page Processing i.e. Barcode, Search/Content, Image Classification. Do not remove this plguin.
Red.gif
DOCUMENT_ASSEMBLER_SCRIPTING_PLUGIN This can be removed if there is no scripting needed at the Document Assembly level.
Green.gif

Review

Remove this module and plugin in it ONLY if there is no reason to review documents after classification. We recommend keeping this module and plugin if the client is using Search Classification.

Extraction

Plugin Name Notes Can I remove this?
BARCODE_EXTRACTION This can be removed if barcode extractions not used or is handled by Recostar Extraction Plugin.
Green.gif
RECOSTAR_EXTRACTION This can be removed if there are no fixed form projects assigned to any document type.
Green.gif
REGULAR_REGEX_EXTRACTION This can be removed if there are no Patterns defined at each Index fields.
Green.gif
KEY_VALUE_EXTRACTION This can be removed if there are no Free Form Extraction, Key Value Extraction, defined for any field.
Green.gif
FUZZYDB This can be removed if there are no document type configured to match content to database.
Green.gif
TABLE_EXTRACTION This can be removed if no document type is configured to capture line items.
Green.gif
EXTRACTION_SCRIPTING_PLUGIN This can be removed if there is no scripting needed at the Extraction level.
Green.gif

Automated Validation

Plugin Name Notes Can I remove this?
AUTOMATED_REGEX_VALIDATION This plugin decides if the document should go to Validation or not based on index field validation rules/regular expressions. This can be removed if there are no document index fields for any document type.
Green.gif
AUTOMATED_VALIDATION_SCRIPTING_PLUGIN This can be removed if there is no scripting needed at the Automated Validation level.
Green.gif
  • If both plugins are removed, module itself can also be removed.

Validation

Remove this module and plugin in it only if there are no document level fields for any of the document types.

Export

Plugin Name Notes Can I remove this?
EXPORT_SCRIPTING_PLUGIN This can be removed if there is no scripting needed at the Export level.
Green.gif
CREATEMULTIPAGE_FILES Do not Remove this Plugin. It creates multi page files (PDF/TIF) for each document.
Red.gif
CSV_FILE_CREATION_PLUGIN This can be removed. It creates a special CSV file export.
Green.gif
TABBED_PDF This can be removed, it creates a PDF with Tabs. Needs to be configured. Used in Mortgage solutions.
Green.gif
IBM_CM_PLUGIN This can be removed if the client is not exporting documents to IBM Content Manager.
Green.gif
COPY_BATCH_XML This plugin exports PDFs and TIFFs and XML files to final drop folder. It can be removed if documents are exported to another repository using other export plugins such as CMIS/Alfresco.
Green.gif
DOCUSHARE_EXPORT This can be removed if the client is not exporting documents to Xerox Docushare.
Green.gif
CMIS_EXPORT This can be removed if the client is not exporting documents to CMIS Compliant repository.
Green.gif
FILEBOUND_EXPORT This can be removed if the client is not exporting documents to Filebound Repository.
Green.gif
NSI_EXPORT This can be removed if the client is not exporting documents to NSI Auto Store.
Green.gif
KEY_VALUE_LEARNING_PLUGIN This can be removed if the client does not want Ephesoft to create Key Value Extraction rule based on Validation users input.
Green.gif
DB_EXPORT This can be removed if the client is not exporting documents fields to Database table.
Green.gif
CLEANUP Do not Remove. It cleans the BI and Import folders after successful export.
Red.gif

Notes

  • Once all the desired plugins and modules are removed, one must click on Validate and Deploy Workflow buttons for changes to take effect for all the new batch instances. See How_To?#How_To_Remove_Plugins_from_Workflow_V3_onwards for detailed instructions.
  • Some plugins cannot be removed because it may have dependencies to each other. Dependencies can be modified by going to Workflow Management Tab. Once the dependencies are removed, Plugins can also be removed from the batch class.
  • CREATE_THUMBNAILS Plugin creates two thumbnail size images one for display and one for image classification. If Image Classification is not used, The switch can be turned on which will prevent thumbnails to be created for image classification.
  • When a plugin is in the workflow but it is turned off, it is still executed. Removing unnecessary plugins from the ephesoft capture workflow will allow workflow to be executed faster.

In regards to processing a single batch at a time

You can control how many batches Ephesoft will process at the same time in the dcma-workflow.properties file. It is called Pickup Service because it picks the batches up and processes it. We recommend keeping the Pickup settings in this file to be 3 or 4. Having said that in the same file, you will also find a CronJob for the pickup. The default value is to pickup and process batches every minute on the 45th second. Which means if a batch is imported at 46th second, it is possible that Ephesoft will wait 59 seconds to process this batch. To shorten the time, you can configure this cron job to pickup batches every 10 seconds using  dcma.pickup.cronjob.expression=0/10 0/1 * ? * *

Was this article helpful to you?

Walter Lee

Comments are closed.