Table of Contents

Method GetText

Namespace
BitMiracle.Docotic.Pdf
Assembly
BitMiracle.Docotic.Pdf.dll

GetText()

Gets text associated with this chunk.

public string GetText()

Returns

string

Text associated with this chunk according to the default PdfTextConversionOptions options.

Remarks

PDF documents draw and store text according to the visual order. To properly extract right-to-left and bidirectional text, you need to reorder characters according to the logical order.

This method uses the inverse Bidi algorithm with the AutoLtr reading direction to reorder text according to the logical order.

After that, codepoints from Arabic and Hebrew presentation forms are normalized in the reordered text. Normalization is made according to the Normalization Form KC.

GetText(PdfTextConversionOptions)

Gets text associated with this chunk according to the specified options.

public string GetText(PdfTextConversionOptions options)

Parameters

options PdfTextConversionOptions

The text conversion options.

Returns

string

Text from this chunk converted according to the specified options.