DocumentRecognitionSettings

DocumentRecognitionSettings class

Settings for the pdf recognition. Contains elements that allow customizing the recognition process.

public class DocumentRecognitionSettings

Constructors

Name Description
DocumentRecognitionSettings(int, int) Initializes a new instance of the DocumentRecognitionSettings class with short set of properties.
DocumentRecognitionSettings(int, int, Language, bool, bool, int) Initializes a new instance of the DocumentRecognitionSettings class with full set of properties.

Properties

Name Description
Allowed { get; set; } Allowed characters set. Determines the type of characters allowed for recognition result.
AutoContrast { get; set; } Allows using an additional contrast correction algorithm for the image before recognition.
AutoDenoising { get; set; } Enables the use of an additional neural network to improve the image - reduce noise. Useful for images with scan artifacts, distortion, spots, flares, gradients, foreign elements.
AutoSkew { set; } Gets or sets a flag indicating whether automatic image skew correction should be enabled. Enabled (true) by default.
DetectAreas { set; } Gets or sets a flag indicating whether automatic text areas detection should be enabled. Enables Document Structure Recognition module. This takes more time and memory to process an image, but provides more accurate results on complex cases. Disable (set to false) for faster image processing or in case of images with simple structure.
DetectAreasMode { get; set; } Allows to select the optimal mode for document type areas: document, photo, plain text, column, image.
IgnoredCharacters { get; set; } Sets blacklist for recognition symbols.
Language { set; } Gets or sets the language used for OCR. Determines the alphabet used during recognition. Multi-language by default.
LinesFiltration { get; set; } Allows to recognize text in the tables (regions surrounded lines).
PagesNumber { get; set; } Set the number of pages for recognition multipage pdf file.
StartPage { get; set; } Set the first page for recognition.
ThreadsCount { set; } Gets or sets the number of threads for processing. By default, 0 means that the image will be processed with the number of threads equal to your number of processors. ThreadsCount = 1 means that the image will be processed in the main thread.
ThresholdValue { set; } Gets or sets custom threshold value for image binarization. Range from 1 to 255.

See Also