Package org.nuxeo.ecm.platform.pdf
Class PDFTextExtractor
- java.lang.Object
-
- org.nuxeo.ecm.platform.pdf.PDFTextExtractor
-
public class PDFTextExtractor extends Object
Extracts raw text from a PDF.- Since:
- 8.10
-
-
Constructor Summary
Constructors Constructor Description PDFTextExtractor(Blob inBlob)PDFTextExtractor(DocumentModel inDoc, String inXPath)Constructor with aDocumentModel.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description StringextractLastPartOfLine(String string)StringextractLineOf(String inString)StringgetAllExtractedLines()voidsetPassword(String password)
-
-
-
Constructor Detail
-
PDFTextExtractor
public PDFTextExtractor(Blob inBlob)
-
PDFTextExtractor
public PDFTextExtractor(DocumentModel inDoc, String inXPath)
Constructor with aDocumentModel. The default value forinXPath(if passednullor "") isfile:content.- Parameters:
inDoc- Input DocumentModel.inXPath- Input XPath.
-
-
Method Detail
-
getAllExtractedLines
public String getAllExtractedLines() throws NuxeoException
- Throws:
NuxeoException
-
extractLineOf
public String extractLineOf(String inString) throws IOException
- Throws:
IOException
-
extractLastPartOfLine
public String extractLastPartOfLine(String string) throws IOException
- Throws:
IOException
-
setPassword
public void setPassword(String password)
-
-