Package org.nuxeo.ecm.platform.pdf
Class PDFTextExtractor
java.lang.Object
org.nuxeo.ecm.platform.pdf.PDFTextExtractor
Extracts raw text from a PDF.
- Since:
- 8.10
-
Constructor Summary
ConstructorsConstructorDescriptionPDFTextExtractor(Blob inBlob) PDFTextExtractor(DocumentModel inDoc, String inXPath) Constructor with aDocumentModel. -
Method Summary
Modifier and TypeMethodDescriptionextractLastPartOfLine(String string) extractLineOf(String inString) voidsetPassword(String password)
-
Constructor Details
-
PDFTextExtractor
-
PDFTextExtractor
Constructor with aDocumentModel. The default value forinXPath(if passednullor "") isfile:content.- Parameters:
inDoc- Input DocumentModel.inXPath- Input XPath.
-
-
Method Details
-
getAllExtractedLines
- Throws:
NuxeoException
-
extractLineOf
- Throws:
IOException
-
extractLastPartOfLine
- Throws:
IOException
-
setPassword
-