Uses of Class
org.faceless.pdf2.PageExtractor.Text
-
Packages that use PageExtractor.Text Package Description org.faceless.pdf2 Contains the classes used to create a PDF document -
-
Uses of PageExtractor.Text in org.faceless.pdf2
Fields in org.faceless.pdf2 with type parameters of type PageExtractor.Text Modifier and Type Field Description static java.util.Comparator<PageExtractor.Text>
PageExtractor. DISPLAYORDER
A Comparator which can be used to sortPageExtractor.Text
objects into their "display" order - the order which they visibly appear on the page, and the order that is returned byPageExtractor.getTextInDisplayOrder()
static java.util.Comparator<PageExtractor.Text>
PageExtractor. NATURALORDER
A Comparator which can be used to sortPageExtractor.Text
objects into their "natural" order - the order which they occur in the PDF page stream, and the order that is returned byPageExtractor.getTextUnordered()
Methods in org.faceless.pdf2 that return PageExtractor.Text Modifier and Type Method Description abstract PageExtractor.Text
PageExtractor.Text. getPrimaryText()
If this text is a subtext or collection of Text object, return the primary text it starts with.abstract PageExtractor.Text
PageExtractor.Text. getRowNext()
Return the next Text item in this row, ornull
if there are noneabstract PageExtractor.Text
PageExtractor.Text. getRowPrevious()
Return the next Text item in this row, ornull
if there are noneabstract PageExtractor.Text
PageExtractor.Text. getSubText(int off, int len)
Return a substring of this Text object as another Text objectMethods in org.faceless.pdf2 that return types with arguments of type PageExtractor.Text Modifier and Type Method Description static java.util.Collection<PageExtractor.Text>
PageExtractor. cropText(java.util.Collection<PageExtractor.Text> all, java.awt.Shape shape)
Given a Collection ofPageExtractor.Text
items, as returned bygetMatchingText()
,PageExtractor.getTextUnordered()
orPageExtractor.getTextInDisplayOrder()
, return a new Collection which contains only Text that falls completely inside the specifiedShape
.java.util.Collection<PageExtractor.Text>
PageExtractor. getMatchingNormalizedText(java.lang.String[] queries, boolean caseinsensitive)
Returns a Collection ofPageExtractor.Text
objects on this page that match any of the specified substrings, based on normalized text.java.util.Collection<PageExtractor.Text>
PageExtractor. getMatchingNormalizedText(java.util.regex.Pattern pattern)
Returns a Collection ofPageExtractor.Text
objects on this page that match the specified regular expression, based on normalized text.java.util.Collection<PageExtractor.Text>
PageExtractor. getMatchingText(java.lang.String query)
Return a Collection ofPageExtractor.Text
items on this page that are equal to the specified substring.java.util.Collection<PageExtractor.Text>
PageExtractor. getMatchingText(java.lang.String[] queries)
Return a Collection ofPageExtractor.Text
items on this page that are equals to one of the specified substrings.java.util.Collection<PageExtractor.Text>
PageExtractor. getMatchingText(java.lang.String[] queries, boolean caseinsensitive)
Return a Collection ofPageExtractor.Text
items on this page that are equals to one of the specified substrings.java.util.Collection<PageExtractor.Text>
PageExtractor. getMatchingText(java.util.regex.Pattern pattern)
Return a Collection ofPageExtractor.Text
items on this page that match the specified Regular Expression.java.util.Collection<PageExtractor.Text>
PageExtractor. getText(java.util.Comparator<PageExtractor.Text> comp)
Return everyPageExtractor.Text
item on the page, in the specified order.java.util.Collection<PageExtractor.Text>
PageExtractor. getTextInDisplayOrder()
Return everyPageExtractor.Text
item on the page, in the order they are displayed on the screen - so the first item in the returned collection will nearest to the top left of the page.java.util.Collection<PageExtractor.Text>
PageExtractor. getTextUnordered()
Return everyPageExtractor.Text
item on the page, in the order they were added to the page.Methods in org.faceless.pdf2 with parameters of type PageExtractor.Text Modifier and Type Method Description java.text.AttributedString
PageExtractor. getStyledText(PageExtractor.Text first, int firstchar, PageExtractor.Text last, int lastchar, boolean displayorder)
Deprecated.java.text.AttributedString
PageExtractor. getStyledText(PageExtractor.Text first, int firstchar, PageExtractor.Text last, int lastchar, java.util.Comparator<PageExtractor.Text> order)
Return an AttributedString containing a contiguous range of text from this PageExtractor.java.lang.StringBuffer
PageExtractor. getText(PageExtractor.Text first, int firstchar, PageExtractor.Text last, int lastchar, boolean displayorder)
Deprecated.java.lang.StringBuffer
PageExtractor. getText(PageExtractor.Text first, int firstchar, PageExtractor.Text last, int lastchar, java.util.Comparator<PageExtractor.Text> order)
Return a StringBuffer containing a contiguous range of text from this PageExtractor.Method parameters in org.faceless.pdf2 with type arguments of type PageExtractor.Text Modifier and Type Method Description static java.util.Collection<PageExtractor.Text>
PageExtractor. cropText(java.util.Collection<PageExtractor.Text> all, java.awt.Shape shape)
Given a Collection ofPageExtractor.Text
items, as returned bygetMatchingText()
,PageExtractor.getTextUnordered()
orPageExtractor.getTextInDisplayOrder()
, return a new Collection which contains only Text that falls completely inside the specifiedShape
.java.text.AttributedString
PageExtractor. getStyledText(PageExtractor.Text first, int firstchar, PageExtractor.Text last, int lastchar, java.util.Comparator<PageExtractor.Text> order)
Return an AttributedString containing a contiguous range of text from this PageExtractor.java.util.Collection<PageExtractor.Text>
PageExtractor. getText(java.util.Comparator<PageExtractor.Text> comp)
Return everyPageExtractor.Text
item on the page, in the specified order.java.lang.StringBuffer
PageExtractor. getText(PageExtractor.Text first, int firstchar, PageExtractor.Text last, int lastchar, java.util.Comparator<PageExtractor.Text> order)
Return a StringBuffer containing a contiguous range of text from this PageExtractor.
-