DocumentRecognitionSettings

DocumentRecognitionSettings class

Settings for the pdf recognition. Contains elements that allow customizing the recognition process.

public class DocumentRecognitionSettings : BaseRecognitionSettings

Constructors

Name Description
DocumentRecognitionSettings(int, int) Initializes a new instance of the DocumentRecognitionSettings class with short set of properties.
DocumentRecognitionSettings(int, int, Language, bool, bool, int) Initializes a new instance of the DocumentRecognitionSettings class with full set of properties.

Properties

Name Description
AllowedCharacters { get; set; } Allowed characters set. Determines the type of characters allowed for recognition result.
AutoContrast { get; set; } Allows using an additional contrast correction algorithm for the image before recognition.
AutoDenoising { get; set; } Enables the use of an additional neural network to improve the image - reduce noise. Useful for images with scan artifacts, distortion, spots, flares, gradients, foreign elements.
AutoSkew { set; } Gets or sets a flag indicating whether automatic image skew correction should be enabled. Enabled (true) by default.
DetectAreas { set; } Gets or sets a flag indicating whether automatic text areas detection should be enabled. Enables Document Structure Recognition module. This takes more time and memory to process an image, but provides more accurate results on complex cases. Disable (set to false) for faster image processing or in case of images with simple structure.
DetectAreasMode { get; set; } Allows to select the optimal mode for document type areas: document, photo, plain text, column, image.
IgnoredCharacters { get; set; } Sets blacklist for recognition symbols.
Language { set; } Gets or sets the language used for OCR. Determines the alphabet used during recognition. Multi-language by default.
LinesFiltration { get; set; } Allows to recognize text in the tables (regions surrounded lines).
PagesNumber { get; set; } Set the number of pages for recognition multipage pdf file.
PreprocessingFilters { get; set; } Allows to prepare the image for OCR by adjusting pre-processing methods.
SkewAngle { set; } Gets or sets angle in degrees for image rotation. Setting this value will disable the AutoSkew property, so that auto skew correction is not applied. Zero by default.
StartPage { get; set; } Set the first page for recognition.
ThreadsCount { set; } Gets or sets the number fo threads for processing. By default, 0 means that the image will be processed with the number of threads equal to your number of processors. ThreadsCount = 1 means that the image will be processed in the main thread.
ThresholdValue { set; } Gets or sets custom threshold value for image binarization. Range from 1 to 255.
UpscaleSmallFont { get; set; } Allows you to use additional algorithms specifically for small font recognition. Useful for images with small size characters.

See Also