1. Nuance

Nuance OCR Command: <Path to NuanceOCR executable> <path to input image> <path to output HOCR xml file> <path to Nuance lczx license file> <Page No Identifier (used to assign PG ID value in HOCR)> <path to Nuance Settings file i.e. SETTINGS.STS> <Auto Rotate Switch> <OCR Confidence Switch> <Optional ZON file if NUANCE extraction is to be used> <Optional NUANCE Extraction Switch> Example: ./NuanceOCR ./a-0001.tif NuanceOCR.xml ./lcxz PG0 ./SETTINGS.STS ON ON

A

Nuance Extraction Command: <Path to NuanceOCR executable> <path to input image> <path to output HOCR xml file> <path to Nuance lczx license file> <Page No Identifier (used to assign PG ID value in HOCR)> <path to Nuance Settings file i.e. SETTINGS.STS> <Auto Rotate Switch> <OCR Confidence Switch> <ZON file if NUANCE extraction is to be used> <NUANCE Extraction Switch> Example: ./NuanceOCR ./a-0001.tif NuanceExtraction.xml ./lcxz PG0 ./SETTINGS.STS ON ON ./docsample.zon ON

B
2. Ghostscript

PDF to TIFF Conversion Command: gs -dNOPAUSE -r300 -sDEVICE=tiff24nc -sCompression=lzw -dBATCH -sOutputFile=”output-tiff-filename-%04d.tif” “input pdf file path” In Linux, Ephesoft uses –sDEVICE as tiff24nc instead of tiffscaled24 as GS in Linux does not support tiffscaled24 device. Thus default Batch Classes in Linux uses tiff24nc as configured device.

3

Example: gs -dNOPAUSE -r300 -sDEVICE=tiff24nc -sCompressionlzw -dBATCH -sOutputFile=”a-%04d.tif” ./multipage-pdf.pdf PDF Optimization PDF optimization is not supported by Ghostscript on Linux thus PDF optimization command is no there.

3. ImageMagick

Convert Command: convert conversion-param “input-file-path” “output-file-path”

Example: [TIFF TO TIFF Conversion] [Multipage TIFF TO Single Page TIFF Conversion] convert -limit area 100mb .\multipage-tif.tif -compress LZW a-%04d.tif”

4

[TIFF TO PNG Conversion] [Multipage TIFF TO Single Page PDF Conversion] convert .\multipage-tif.tif -colorspace gray -alpha off a-%04d.png

5

[TIFF TO PNG Thumbnail Conversion] convert .\sample.tif -colorspace rgb -thumbnail 200×150 a-%04d.png

6

[COLORED TIFF TO PDF Conversion] convert test.tif -quality 100.0 -compress LZW out.pdf

7

[NON-COLORED TIFF TO PDF Conversion] convert test.tif -quality 100.0 -monochrome -compress LZW out.pdf

8

[PDF TO TIFF Conversion] convert -limit area 100mb sample.pdf -compress LZW a-%04d.tif

9
4. Tesseract

Tesseract OCR Command: tesseract “input TIFF file path” “output html file path without .html extension” “-l eng” +”hocr.txt file path” Example: tesseract ./a.tif out –l eng +./hocr.txt

5. Zxing

Zxing Barcode Command: java -cp zxing-1.6.0.jar:.: com.google.zxing.client.j2se.CommandLineRunner “png file path”

Was this article helpful to you?

Engineering

Comments are closed.