Skip to main content
Skip table of contents

Classify Document

The Classify Document pre-processor is used to take the input document and pass it to the classification engine (Document Classification Assistant - DCA) in order to either predict or classify the document.

DCA can be used to classify documents against a list of known document types. This is a locally trained model that is built using your own documents.

The Classify Document pre-processor screen is shown below:

image-20260126-234534.png

Stage Settings

image-20260126-234627.png

Action

Choose from one these actions:

image-20260126-235548.png

Specify an action for the pre-processor to perform against a DCA page:

Classify

Passes the text layer to the engine and adds it to the training data for the specified classification.

Setting

Description

Page

The DCA page that contains the trained model you want to target.

image-20260127-050403.png

Classification

The classification category to add the current text layer to training data for.

image-20260127-050238.png

Predict

Passes the text layer to the engine and returns the top three classifications.

Setting

Description

Page

The DCA page that contains the trained model you want to target.

image-20260127-050403.png

Minimum Confidence Level

Consider a specific category a “successful prediction” if the confidence value is greater than this value.

Eg: 80

image-20260127-050709.png

Confidence Difference Level

If two “successful predictions” are returned that exceed the minimum confidence level, then this is the difference that must exist between the two results for the higher confidence result to still be considered a “successful prediction”.

Eg: 20

image-20260127-050923.png

Target metadata ID

The id of the metadata which the classification result will be saved into

image-20260127-050952.png

Route successful prediction

Enable this option to take the top “successful prediction” and route the indexing item to the categories target page/queue.

image-20260127-051058.png

Get Classification Target

Queries the classification engine to return target information for a specific classification.

Setting

Description

Page

The DCA page that contains the trained model you want to target.

image-20260127-050403.png

Classification

The classification category to retrieve the target information for.

image-20260127-050238.png

Target metadata ID

The id of the metadata which the result will be saved into.

image-20260127-050952.png

Saving the Pre-Processor

Press the Apply button image-20260114-021549.png to save the pre-processor settings

Saving the RIA Page

Press the Save Button image-20260114-021648.png to save the RIA page changes.

 

Testing the Pre-Processor

  1. Make sure its enabled.

  2. Run an input document through the queue.

  3. Check that the document type has been correctly identified by the Classify Document pre-processor.

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.