Skip to content

PDFTextWord

Namespace: O2S.Components.PDF4NET.Content

Defines a word returned by a text extraction operation.

public class PDFTextWord

Inheritance ObjectPDFTextWord

Properties

Text

Gets the result text.

public string Text { get; }

Property Value

String
The text associated with the word.


TextRuns

Gets the text fragments.

public PDFTextRunCollection TextRuns { get; internal set; }

Property Value

PDFTextRunCollection
The collection of text fragments that compose the word.


TypographicWordCorners

Gets the typographic coordinates of the word corners.

public PDFPoint[] TypographicWordCorners { get; }

Property Value

PDFPoint[]
The typographic coordinates on the page of the word's corners.

Remarks

The order of the corners in the array is the following: lower left, lower right, upper right, upper left. The typographic corners consider the font's Descent for the lower corners. The PDFTextWord.WordCorners property considers the text baseline for the lower corners.


WordCorners

Gets the visual coordinates of the word corners.

public PDFPoint[] WordCorners { get; }

Property Value

PDFPoint[]
The coordinates on the page of the word's corners.

Remarks

The order of the corners in the array is the following: lower left, lower right, upper right, upper left. These corners match the text baseline for the lower corners. The PDFTextWord.TypographicWordCorners property considers the font's Descent for the lower corners.


Methods

ToString()

Converts the object to string.

public string ToString()

Returns

String
This object as string.