Package net.webpdf.wsclient.openapi
Class OperationExtractionParagraphs
- java.lang.Object
-
- net.webpdf.wsclient.openapi.OperationExtractionParagraphs
-
public class OperationExtractionParagraphs extends Object
Extract text content of the PDF document, separated by paragraphs Generates an ASCII text, XML, or JSON file that will be returned as a result when the web service is called and that will contain all texts in the PDF document separated by paragraphs. In order for this to work, the paragraphs must be found in the PDF as elements. A purely visual separation will not work!
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classOperationExtractionParagraphs.FileFormatEnumUsed to define the output format for the PDF document text contents being extracted
-
Field Summary
Fields Modifier and Type Field Description static StringJSON_PROPERTY_FILE_FORMATstatic StringJSON_PROPERTY_PAGES
-
Constructor Summary
Constructors Constructor Description OperationExtractionParagraphs()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanequals(Object o)OperationExtractionParagraphsfileFormat(OperationExtractionParagraphs.FileFormatEnum fileFormat)@Nullable OperationExtractionParagraphs.FileFormatEnumgetFileFormat()Used to define the output format for the PDF document text contents being extracted@Nullable StringgetPages()Used to define which page(s) should be used for the extraction mode.inthashCode()OperationExtractionParagraphspages(String pages)voidsetFileFormat(OperationExtractionParagraphs.FileFormatEnum fileFormat)voidsetPages(String pages)StringtoString()
-
-
-
Field Detail
-
JSON_PROPERTY_FILE_FORMAT
public static final String JSON_PROPERTY_FILE_FORMAT
- See Also:
- Constant Field Values
-
JSON_PROPERTY_PAGES
public static final String JSON_PROPERTY_PAGES
- See Also:
- Constant Field Values
-
-
Method Detail
-
fileFormat
public OperationExtractionParagraphs fileFormat(OperationExtractionParagraphs.FileFormatEnum fileFormat)
-
getFileFormat
@Nullable public @Nullable OperationExtractionParagraphs.FileFormatEnum getFileFormat()
Used to define the output format for the PDF document text contents being extracted. * text = Text document * xml = XML document * json = JSON data structure- Returns:
- fileFormat
-
setFileFormat
public void setFileFormat(OperationExtractionParagraphs.FileFormatEnum fileFormat)
-
pages
public OperationExtractionParagraphs pages(String pages)
-
getPages
@Nullable public @Nullable String getPages()
Used to define which page(s) should be used for the extraction mode. The page number can be either an individual page, a page range, or a list (separated with commas) (e.g., \"1,5-6,9\"). A blank value or \"\\*\" selects all pages of the PDF document.- Returns:
- pages
-
setPages
public void setPages(String pages)
-
-