DocumentRecognitionSettings
Contents
[
Hide
]DocumentRecognitionSettings class
Settings for the pdf recognition. Contains elements that allow customizing the recognition process.
public class DocumentRecognitionSettings : BaseRecognitionSettings
Constructors
| Name | Description |
|---|---|
| DocumentRecognitionSettings(int, int) | Initializes a new instance of the DocumentRecognitionSettings class with short set of properties. |
| DocumentRecognitionSettings(int, int, Language, bool, bool, int) | Initializes a new instance of the DocumentRecognitionSettings class with full set of properties. |
Properties
| Name | Description |
|---|---|
| AllowedCharacters { get; set; } | Allowed characters set. Determines the type of characters allowed for recognition result. |
| AutoContrast { get; set; } | Allows using an additional contrast correction algorithm for the image before recognition. |
| AutoDenoising { get; set; } | Enables the use of an additional neural network to improve the image - reduce noise. Useful for images with scan artifacts, distortion, spots, flares, gradients, foreign elements. |
| AutoSkew { set; } | Gets or sets a flag indicating whether automatic image skew correction should be enabled. Enabled (true) by default. |
| DetectAreasMode { get; set; } | Allows to select the optimal mode for document type areas: document, photo, plain text, column, image. |
| IgnoredCharacters { get; set; } | Sets blacklist for recognition symbols. |
| Language { set; } | Gets or sets the language used for OCR. Determines the alphabet used during recognition. Multi-language by default. |
| LinesFiltration { get; set; } | Allows to recognize text in the tables (regions surrounded lines). |
| PagesNumber { get; set; } | Set the number of pages for recognition multipage pdf file. |
| PreprocessingFilters { get; set; } | Allows to prepare the image for OCR by adjusting pre-processing methods. |
| SkewAngle { set; } | Gets or sets angle in degrees for image rotation. Setting this value will disable the AutoSkew property, so that auto skew correction is not applied. Zero by default. |
| StartPage { get; set; } | Set the first page for recognition. |
| ThreadsCount { set; } | Gets or sets the number fo threads for processing. By default, 0 means that the image will be processed with the number of threads equal to your number of processors. ThreadsCount = 1 means that the image will be processed in the main thread. |
| ThresholdValue { set; } | Gets or sets custom threshold value for image binarization. Range from 1 to 255. |
| UpscaleSmallFont { get; set; } | Allows you to use additional algorithms specifically for small font recognition. Useful for images with small size characters. |
See Also
- class BaseRecognitionSettings
- namespace Aspose.OCR
- assembly Aspose.OCR