java.lang.Object
org.hippoecm.frontend.editor.plugins.resource.PdfParser

public class PdfParser extends Object
  • Method Details

    • parse

      public static String parse(InputStream inputStream)
      Parameters:
      inputStream - the pdf inputStream that gets parsed, *and* that gets closed when finished
      Returns:
      the first 100*1000 chars from the pdf as String. Also see Tika.setMaxStringLength(int)
      Throws:
      IllegalStateException - in case of a TikaException or IOException