Document processing
With IQ4docs, you can edit documents extensively and forward them to different destinations. The document can be available as a paper version and be fed into the process by scanning – but you can also feed the documents into the system when they are in file form (document input).
To be able to use the document input, all devices with Embedded Client must have an OCR license and the license option OCR From E-Mail / Directory must be active, see Licensing.

When using a document input, the files are not sent to document processing via the device, as is the case with scanning, but are already available as a file and are fed in via a directory or an e-mail mailbox, see Document Input.

The image enhancements are carried out in the workflow by the Image enhancement module.

- Do not remove: The pages remain unchanged.
- Remove: The margins of the sheet are replaced by white. This can remove the dark shadows that appear at the edges of the page during a book scan, for example. This function can only be used on white pages.

- Do not remove: The pages remain unchanged.
- Remove: Attempts are made to remove clearly visible remnants of punched holes scanned into the sheet margins, which are usually caused by shadows.

- Do not correct: The pages remain unchanged.
- Correct: An attempt is made to straighten the content of a page scanned at a slight angle. This function is always performed automatically during OCR recognition to improve the character recognition result.

- Do not remove: The pages remain unchanged.
- Remove: Blank pages are removed from the document. The paper must not be too dirty or creased, as the image processing system might detect this as desired page content.

- Do not remove: The pages remain unchanged.
- Remove: An attempt is made to detect and automatically remove white borders around the scanned image.

- Off: The pages remain unchanged.
- On: An attempt is made to fill up the border pixels of lines, thereby slightly thickening the line. This function necessarily requires the b/w format (for grayscale or color the option is ignored).

- Off: The pages remain unchanged.
- On: An attempt is made to remove the border pixels from lines, thereby making the line slightly thinner. This function necessarily requires the b/w format (for grayscale or color the option is ignored).

To get a better image enhancement result (e.g. when removing punch holes) the image can be inverted.
The inversion is done internally and is used purely for processing. The scan itself is then always output normally (i.e. not inverted). Try out which setting gives the best result for your scans.
- Auto: The system itself decides whether the processing is done with a normal or inverted image.
- Normal: The processing is done with a normal image.
- Inverse: The processing is done with an inverted image.

The conversion into the desired file format is done by the The Document Processing Module.

- JPG
- XPS
- PNG
- TIFF
- PDF - PDF file contains JPEG image with configurable compression.
- PDF (image) - PDF file contains TIF image with configurable compression.
- PDF/A (image) - PDF/A compliant file contains TIF image with configurable compression.
- hcPDF - High compression PDF.
OCR formats (non-license-free OCR formats are only selectable on the device if the device has an OCR license, see also OCR and barcode recognition)
- Portable Document Format (pdf)
- PDF TS (OCR + image) (license-free OCR with Tesseract)
- PDF/A TS (OCR + image) (license-free OCR with Tesseract)
- PDF/A (OCR only)
- PDF/A-1b (OCR only)
- PDF/A-2b (OCR only)
- PDF/A-2u (OCR only)
- PDF (OCR + image)
- PDF/A (OCR + image)
- PDF/A-1b (OCR + image)
- PDF/A-2b (OCR + image)
- PDF/A-2u (OCR + image)
- Word (doc) OCR
- Word (docx) OCR
- Excel (xls) OCR
- Excel (xlsx) OCR
- PowerPoint (ppt) OCR
- PowerPoint (pptx) OCR
- Rich Text Format (RTF) OCR
- Text (txt) OCR

- Off: File separation is switched off, all pages scanned together in the document feeder are stored in one file. This does not work if the file format cannot contain several pages or with any image formats (e.g. TIFF, JPG).
- Number of pages: File separation is active and saves the specified number of pages in one file (e.g. if 6 pages are scanned and 2 pages is selected, this results in 3 files with 2 pages each). B/W TIFF documents are compressed using LZW.

JPEG quality lets you specify the quality of JPEG files embedded in PDF files. Consequently, this setting only affects PDF files. This setting also allows you to influence the size of the resulting PDF file.
- 50%
- 75%
- 90%
- 100%

Similar to the Combine function when copying, this function allows two pages to be scanned vertically and reduced in size to a landscape page.
- Off: Two on One is off, the pages remain unchanged.
- On: Two pages are displayed vertically and reduced in size on a landscape page side by side.

The PDF file format allows a file to be password protected. This setting is ignored for other file formats. The password is required to open the PDF file.
- Without: The file is created without password protection.
- Set: A password was specified. This password is assigned to the file.
According to the PDF definition, password protection is not possible for any archive formats (PDF/A...).

The files can be stored at different destinations, the subsequent modules for the destinations are available.

When using the document input Folder, you can pass an XML file to a document workflow in addition to an image file, which can be used to pass variables to the workflow. The variables can then be used in the workflow like the variables of a user interview (see also Document processing).
The name of the variable must not contain special characters. Allowed are small and capital letters (no umlauts or similar), numbers and the dot ".".
If the variable name is used in several places, all places can influence the variable value.
The XML file must have the same name as the associated image file (test.tif > test.xml).
<DocumentOptions>
<UserLogin>administrator</UserLogin>
<Variables>
<variable name="Name1" value="value1" type="string"/>
<variable name="Name2" value="222" type="int"/>
</Variables>
</DocumentOptions>
- The main area <DocumentOptions> must be present.
- The <Variables> area contains variable definitions.
- <variable name="Name1" value="value1" type="string"/> defines a variable by the name of Name1 with the content value1 of the type string. A numerical value can also be defined as an int (integer).
The type is of importance for formatting the variable (if it is missing, string is assumed), see also Variables.