ParagraphAbsorber

ParagraphAbsorber class

Represents an absorber object of page structure objects such as sections and paragraphs.
Performs search for sections and paragraphs of text and provides access for rectangles and polydons that describes it in text coordinate space.
Also performs text segments search and provides access to search results via TextFragments collections grouped by structure elements.

The ParagraphAbsorber type exposes the following members:

Constructors

NameDescription
ParagraphAbsorber()Initializes a new instance of the ParagraphAbsorber that performs search for sections/paragraphs of the document or page.
ParagraphAbsorber(sections_search_depth)Initializes a new instance of the ParagraphAbsorber class

Properties

NameDescription
page_markupsGets collection of PageMarkup that were absorbed.
sections_search_depthGets or sets value that instructs how many times sequential searches for more fine elements of structure will be performed.
Default search depth is 3.
It means three searches for horizontally divided sections (headers, paragraphs etc) and three searches for vertically divided ones (columns).
is_multicolumn_paragraphs_allowedGets or sets value that indicates whether starting text lines of a next section may be treated as continuation of the last paragraph of a previous section.

Methods

NameDescription
visit(doc)Performs search for sections and paragraphs on the specified Document.
visit(page)Performs search on the specified Page.

See Also