Class PDFTextExtractor


  • public class PDFTextExtractor
    extends Object
    Extracts raw text from a PDF.
    Since:
    8.10
    • Constructor Detail

      • PDFTextExtractor

        public PDFTextExtractor​(Blob inBlob)
      • PDFTextExtractor

        public PDFTextExtractor​(DocumentModel inDoc,
                                String inXPath)
        Constructor with a DocumentModel. The default value for inXPath (if passed null or "") is file:content.
        Parameters:
        inDoc - Input DocumentModel.
        inXPath - Input XPath.