Zone OCR, Barcode, QRCode module

Delete zone

The Delete Zone button allows you to remove a zone. All zone settings and the zone marking on the page image are removed.

Test Zone

The test function tests the zone in the text window which appears. The text that was recognized or read from a barcode appears there. All relevant settings for the text that has been read are taken into account (number of words or trimming spaces).

Name

The name of the zone is displayed above the zone marking. The name is important for addressing the zone contents with variables (variables contain the zone name). For this reason, care should be taken when changing the name, as the variables (see Variables) might have been used for scan destinations in the metafile generation (see also The Scan Destination SMB Module). The name of the zone is displayed above the zone marking.

The name of the variable must not contain special characters. Allowed are small and capital letters (no umlauts or similar), numbers and the dot ".".
If the variable name is used in several places, all places can influence the variable value.

When changing the name of the zone, note that it will no longer be possible to resolve any variables used in metafiles.

Type

For Type, specify what is contained in the zone. The following options are available.

Barcode 1D

Barcode 1D is a code that essentially consists of bars. The following types are supported.

Page Separation

The contents of the zone that is read can be used to cause the scan to be terminated at this point. The decisive factor is the condition for the page separation (this setting is only displayed if page separation is active). B/W TIFF documents are compressed with CCITT Group 4.

Technically speaking, as soon as a separation is performed, a new instance of the workflow is started for each separated document. These instances then run across the separating zone OCR module and continue the workflow (another zone OCR module can follow here if necessary).

Do not separate: Page separation is not activated.

Separate and keep: The scan is separated at the pages where the zone content condition applies. The page with the separating zone is used as the first page of the new document.

Separate and remove: The scan is separated at the pages where the zone content condition applies. The page with the separation zone is removed from the scan.

Group by content: The separated sections with the same zone content will be merged into one file (in the order in which they were scanned). This means there will be as many files as there are different zone contents in the scanned batch.

If a page has multiple zones, enter the page separation information for one zone only. Entries in multiple zones can lead to an unexpected result.

Condition for page separation

Enter the condition here that is to trigger the page separation. You can use wildcards or regular expressions (not both at the same time).

Filter zone content

The content of a zone can be filtered using a regular expression. This means that not the complete content of a zone is transferred to the target variable, but only the result of the regular expression applied to the content.

Regular expressions can be used to implement very complex searches and filters. For example, the invoice number, invoice amount or IBAN can be read from an invoice. These values do not have to be in the same place in an invoice to do this. On an invoice, the invoice amount may be centered. On another it might be on the right. However, as long as the position can be found by a regular expression (e.g. because net always precedes it), the desired values are found and stored in the zone variable.

Example: Between the word net or amount and the word EUR or €, all number characters are found and output with two digits after a period or comma.

(?<=Net|Amount)(:)*([\s]*|(EUR|€))*(((\d*(,|.))*\d{2}))*

Since one cannot be one hundred percent sure that all values searched for in this way have been correctly recognized, the Document Review feature is very well suited for checking the values in the WebClient and correcting or supplementing them if necessary, see also Document review.

Options

Number of words from the beginning

Specify here how many words from the beginning of the recognized text should be used (e.g. 1 uses only the first word of the recognized text as zone text (visible in the zone preview).