Here are general guidelines to follow before placing tif files in the “Learn” folder for optimum results:

  • TIF’s should be named with tif extension, not tiff (some older versions of ephesoft handle tif only)
  • TIF’s should be single page tif type, not multipage
  • TIF’s should be flattened, not multilayer
  • Normalize TIF’s to specific settings (compression:group4; bit depth:4-bit)

Ideally, you should batch process all of above settings to your tif images to insure continuity. This is especially important when tif  images are coming from sources beyond your control.


If you are still having issues with “Learning” Files, try the following:

Step-by-step guide

1. Press Create Folders in the batch class. Missing folders can cause errors.
2. Leave the good XML files from previous attempts in place.
3. Search for *.XML and delete files smaller then 2kb. These will be items that failed to process correctly.
4. Delete all existing .BD files be searching parent folder for *.bd, pressing CTRL+A and then delete the highlighted files.
5. Delete all existing .rsbatch files by searching parent folder for *.rsbatch, pressing CTRL+A and then delete the highlighted files.
6. Press Learn Files in the batch class
7. Patiently watch the task manager and look for the RecoStar process traverse through the Lucene folders.
8. When/If you see the process stop on a folder for an extended period of time this folder will require special handling.
9. You may need to End Process on the RecoStar process in the Windows Task Manager to get the Batch Process to iterate past a problematic folder. By doing this repeatedly you can get the system to learn files without the problematic folders. This will result in an incomplete Lucene index so this is only to move through the overall process and identify problematic folders.
10. Continue to repeat the special handling of each folder in procession until you have successfully learned all files.

Was this article helpful to you?

Walter Lee

Comments are closed.