Class PdfTextExtractor

java.lang.Object
com.lowagie.text.pdf.parser.PdfTextExtractor

public class PdfTextExtractor extends Object
Extracts text from a PDF file.
Since:
2.1.4
  • Constructor Details

    • PdfTextExtractor

      public PdfTextExtractor(PdfReader reader)
      Creates a new Text Extractor object.
      Parameters:
      reader - the reader with the PDF
  • Method Details

    • getTextFromPage

      public String getTextFromPage(int page) throws IOException
      Gets the text from a page.
      Parameters:
      page - the page number of the page
      Returns:
      a String with the content as plain text (without PDF syntax)
      Throws:
      IOException