Aspose::Pdf::Text::ParagraphAbsorber class

ParagraphAbsorber class

Represents an absorber object of page structure objects such as sections and paragraphs. Performs search for sections and paragraphs of text and provides access for rectangles and polydons that describes it in text coordinate space. Also performs text segments search and provides access to search results via TextFragments collections grouped by structure elements.

class ParagraphAbsorber : public System::Object

Methods

MethodDescription
get_IsMulticolumnParagraphsAllowed() constGets value that indicates whether starting text lines of a next section may be treated as continuation of the last paragraph of a previous section.
get_PageMarkups() constGets collection of PageMarkup that were absorbed.
get_ParagraphAbsorberOptions() constGets the ParagraphAbsorberOptions.
get_SectionsSearchDepth() constGets value that instructs how many times sequential searches for more fine elements of structure will be performed. Default search depth is 3. It means three searches for horizontally divided sections (headers, paragraphs etc) and three searches for vertically divided ones (columns).
get_TextReplaceOptions() constGets the TextReplaceOptions.
ParagraphAbsorber()Initializes a new instance of the ParagraphAbsorber that performs search for sections/paragraphs of the document or page.
ParagraphAbsorber(int32_t)Initializes a new instance of the ParagraphAbsorber that performs search for sections/paragraphs of the document or page.
ParagraphAbsorber(System::SharedPtr<Aspose::Pdf::Text::ParagraphAbsorberOptions>)Initializes a new instance of the ParagraphAbsorber that performs search for sections/paragraphs of the document or page with the specified parameters.
ParagraphAbsorber(int32_t, System::SharedPtr<Aspose::Pdf::Text::ParagraphAbsorberOptions>)Initializes a new instance of the ParagraphAbsorber that performs search for sections/paragraphs of the document or page with the specified parameters.
set_IsMulticolumnParagraphsAllowed(bool)Sets value that indicates whether starting text lines of a next section may be treated as continuation of the last paragraph of a previous section.
set_ParagraphAbsorberOptions(System::SharedPtr<Aspose::Pdf::Text::ParagraphAbsorberOptions>)Sets the ParagraphAbsorberOptions.
set_SectionsSearchDepth(int32_t)Sets value that instructs how many times sequential searches for more fine elements of structure will be performed. Default search depth is 3. It means three searches for horizontally divided sections (headers, paragraphs etc) and three searches for vertically divided ones (columns).
set_TextReplaceOptions(System::SharedPtr<Aspose::Pdf::Text::TextReplaceOptions>)Sets the TextReplaceOptions.
Visit(System::SharedPtr<Document>)Performs search for sections and paragraphs on the specified Document.
Visit(System::SharedPtr<Page>)Performs search on the specified Page.

Remarks

When the search is completed the ParagraphAbsorber::PageMarkups collection will contains PageMarkup objects that represents page structure by collections of MarkupSection and MarkupParagraph. The TextFragment object provides access to the search occurrence text, text properties, and allows to edit text and change the text state (font, font size, color etc).

See Also