DocumentRecognitionSettings
Contents
[
Hide
]DocumentRecognitionSettings class
Settings for the pdf recognition.
Contains elements that allow customizing the recognition process.
The DocumentRecognitionSettings type exposes the following members:
Constructors
| Name | Description |
|---|---|
| DocumentRecognitionSettings(start_page, pages_number) | Initializes a new instance of the DocumentRecognitionSettings class |
| DocumentRecognitionSettings(start_page, pages_number, language, detect_areas, auto_skew, threshold) | Initializes a new instance of the DocumentRecognitionSettings class |
Properties
| Name | Description |
|---|---|
| ignored_symbols | Sets blacklist for recognition symbols. |
| ignored_characters | Sets blacklist for recognition symbols. |
| allowed_symbols | Set the allowed characters with alphabet property. |
| lines_filtration | Allows to recognize text in the tables (regions surrounded lines). |
| preprocessing_filters | Allows to prepare the image for OCR by adjusting pre-processing methods. |
| auto_contrast | Allows using an additional contrast correction algorithm for the image before recognition. |
| allowed_characters | Allowed characters set. Determines the type of characters allowed for recognition result. |
| detect_areas_mode | Allows to select the optimal mode for document type areas: document, photo, plain text, column, image. |
| auto_denoising | Enables the use of an additional neural network to improve the image - reduce noise. Useful for images with scan artifacts, distortion, spots, flares, gradients, foreign elements. |
| upscale_small_font | Allows you to use additional algorithms specifically for small font recognition. Useful for images with small size characters. |
| start_page | Set the first page for recognition. |
| pages_number | Set the number of pages for recognition multipage pdf file. |
See Also
- namespace aspose.ocr
- assembly Aspose.ocr