PdfTextData.GetText Method

Gets text associated with this chunk.

Namespace:  BitMiracle.Docotic.Pdf
Assembly:  BitMiracle.Docotic.Pdf (in BitMiracle.Docotic.Pdf.dll)


public string GetText()
Public Function GetText As String

Return Value

Type: String
Text associated with this chunk according to the default PdfTextConversionOptions options.


PDF documents draw and store text according to the visual order. To properly extract right-to-left and bidirectional text, you need to reorder characters according to the logical order.

This method uses the inverse Bidi algorithm with the AutoLtr reading direction to reorder text according to the logical order.

After that, codepoints from Arabic and Hebrew presentation forms are normalized in the reordered text. Normalization is made according to the Normalization Form KC.

See Also