Class PDFTextExtractor

java.lang.Object
org.nuxeo.ecm.platform.pdf.PDFTextExtractor

public class PDFTextExtractor extends Object
Extracts raw text from a PDF.
Since:
8.10
  • Constructor Details

    • PDFTextExtractor

      public PDFTextExtractor(Blob inBlob)
    • PDFTextExtractor

      public PDFTextExtractor(DocumentModel inDoc, String inXPath)
      Constructor with a DocumentModel. The default value for inXPath (if passed null or "") is file:content.
      Parameters:
      inDoc - Input DocumentModel.
      inXPath - Input XPath.
  • Method Details