public static final class PdfFormatConversionOptions.PuaProcessingStrategy
extends com.aspose.ms.System.Enum
Some PDF documents have special unicode symbols, which are belonged to Private Use Area (PUA), see description at https://en.wikipedia.org/wiki/Private_Use_Areas. This symbols cause an PDF/A compliant errors like "Text is mapped to Unicode Private Use Area but no ActualText entry is present". This enumeration declares a strategies which can be used to handle PUA symbols.
Modifier and Type | Field and Description |
---|---|
static int |
None
Disable PUA symbol processing.
|
static int |
SubstitutePuaSymbols
This strategy works slower than 'SurroundPuaTextWithEmptyActualText' but it can remove
PUA compliant errors for documents which can't be handled properly by
SurroundPuaTextWithEmptyActualText.
|
static int |
SurroundPuaTextWithEmptyActualText
Inserts marked content block with ActualText entry which contains empty text.
|
public static final int None
Disable PUA symbol processing. This strategy used by default for PDF/A documents with Level B conformance.
public static final int SurroundPuaTextWithEmptyActualText
Inserts marked content block with ActualText entry which contains empty text. This strategy gives good results for documents without marked content blocks. Used by default for PDF/A documents with Level A conformance.
public static final int SubstitutePuaSymbols
This strategy works slower than 'SurroundPuaTextWithEmptyActualText' but it can remove PUA compliant errors for documents which can't be handled properly by SurroundPuaTextWithEmptyActualText. PUA symbols are substituted on symbol 'space' or special unicode (some PUA symbols have unicode analogs). Substitution applied not to document's text but to font's internal data ToUnicode so it doesn't affects symbol's vision but it affects symbol's presentation in copy/paste operation system buffer.