Class BaseExtractionType

  • Direct Known Subclasses:
    ExtractionInfoType, ExtractionLinksType, ExtractionParagraphsType, ExtractionTextType, ExtractionWordsType

    public class BaseExtractionType
    extends Object
     <?xml version="1.0" encoding="UTF-8"?><p xmlns:tns="http://schema.webpdf.de/1.0/operation" xmlns:xs="http://www.w3.org/2001/XMLSchema">General attributes for the content elements.</p>
     

    Java class for BaseExtractionType complex type

    .

    The following schema fragment specifies the expected content contained within this class.

    
     <complexType name="BaseExtractionType">
       <complexContent>
         <restriction base="{http://www.w3.org/2001/XMLSchema}anyType">
           <attribute name="fileFormat" default="xml">
             <simpleType>
               <restriction base="{http://schema.webpdf.de/1.0/operation}ExtractionFileFormatType">
                 <enumeration value="json"/>
                 <enumeration value="text"/>
                 <enumeration value="xml"/>
               </restriction>
             </simpleType>
           </attribute>
           <attribute name="pages" type="{http://www.w3.org/2001/XMLSchema}string" default="" />
         </restriction>
       </complexContent>
     </complexType>
     
    • Field Detail

      • fileFormat

        protected ExtractionFileFormatType fileFormat
         <?xml version="1.0" encoding="UTF-8"?><p xmlns:tns="http://schema.webpdf.de/1.0/operation" xmlns:xs="http://www.w3.org/2001/XMLSchema">Used to define the output format for the PDF document text contents being extracted.
                                 <ul><li>text = Text document</li><li>xml = XML document</li><li>json = JSON data structure</li></ul></p>
         
      • pages

        protected String pages
         <?xml version="1.0" encoding="UTF-8"?><p xmlns:tns="http://schema.webpdf.de/1.0/operation" xmlns:xs="http://www.w3.org/2001/XMLSchema">Used to define which page(s) should be used for the extraction mode. The page number can be either an individual page, a page range, or a list (separated with commas) (e.g., "1,5-6,9"). A blank value or "*" selects all pages of the PDF document.</p>
         
    • Constructor Detail

      • BaseExtractionType

        public BaseExtractionType()
    • Method Detail

      • getFileFormat

        public ExtractionFileFormatType getFileFormat()
         <?xml version="1.0" encoding="UTF-8"?><p xmlns:tns="http://schema.webpdf.de/1.0/operation" xmlns:xs="http://www.w3.org/2001/XMLSchema">Used to define the output format for the PDF document text contents being extracted.
                                 <ul><li>text = Text document</li><li>xml = XML document</li><li>json = JSON data structure</li></ul></p>
         
        Returns:
        possible object is ExtractionFileFormatType
      • isSetFileFormat

        public boolean isSetFileFormat()
      • getPages

        public String getPages()
         <?xml version="1.0" encoding="UTF-8"?><p xmlns:tns="http://schema.webpdf.de/1.0/operation" xmlns:xs="http://www.w3.org/2001/XMLSchema">Used to define which page(s) should be used for the extraction mode. The page number can be either an individual page, a page range, or a list (separated with commas) (e.g., "1,5-6,9"). A blank value or "*" selects all pages of the PDF document.</p>
         
        Returns:
        possible object is String
      • setPages

        public void setPages​(String value)
        Sets the value of the pages property.
        Parameters:
        value - allowed object is String
        See Also:
        getPages()
      • isSetPages

        public boolean isSetPages()