DocumentRecognitionSettings
Contents
[
Hide
]DocumentRecognitionSettings class
Settings for the pdf recognition. Contains elements that allow customizing the recognition process.
public class DocumentRecognitionSettings : BaseRecognitionSettings
Constructors
Name | Description |
---|---|
DocumentRecognitionSettings(int, int) | Initializes a new instance of the DocumentRecognitionSettings class with short set of properties. |
DocumentRecognitionSettings(int, int, Language, bool, bool, int) | Initializes a new instance of the DocumentRecognitionSettings class with full set of properties. |
Properties
Name | Description |
---|---|
AllowedCharacters { get; set; } | Allowed characters set. Determines the type of characters allowed for recognition result. |
AutoContrast { get; set; } | Allows using an additional contrast correction algorithm for the image before recognition. |
AutoDenoising { get; set; } | Enables the use of an additional neural network to improve the image - reduce noise. Useful for images with scan artifacts, distortion, spots, flares, gradients, foreign elements. |
AutoSkew { set; } | Gets or sets a flag indicating whether automatic image skew correction should be enabled. Enabled (true) by default. |
DetectAreasMode { get; set; } | Allows to select the optimal mode for document type areas: document, photo, plain text, column, image. |
IgnoredCharacters { get; set; } | Sets blacklist for recognition symbols. |
Language { set; } | Gets or sets the language used for OCR. Determines the alphabet used during recognition. Multi-language by default. |
LinesFiltration { get; set; } | Allows to recognize text in the tables (regions surrounded lines). |
PagesNumber { get; set; } | Set the number of pages for recognition multipage pdf file. |
PreprocessingFilters { get; set; } | Allows to prepare the image for OCR by adjusting pre-processing methods. |
SkewAngle { set; } | Gets or sets angle in degrees for image rotation. Setting this value will disable the AutoSkew property, so that auto skew correction is not applied. Zero by default. |
StartPage { get; set; } | Set the first page for recognition. |
ThreadsCount { set; } | Gets or sets the number fo threads for processing. By default, 0 means that the image will be processed with the number of threads equal to your number of processors. ThreadsCount = 1 means that the image will be processed in the main thread. |
ThresholdValue { set; } | Gets or sets custom threshold value for image binarization. Range from 1 to 255. |
UpscaleSmallFont { get; set; } | Allows you to use additional algorithms specifically for small font recognition. Useful for images with small size characters. |
See Also
- class BaseRecognitionSettings
- namespace Aspose.OCR
- assembly Aspose.OCR