Package org.icij.datashare.text
Class Document
- java.lang.Object
-
- org.icij.datashare.text.Document
-
- All Implemented Interfaces:
Serializable,Entity
public class Document extends Object implements Entity
- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classDocument.Status
-
Constructor Summary
Constructors Constructor Description Document(String id, Project project, Path filePath, String content, List<Map<String,String>> content_translated, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Set<Pipeline.Type> nerTags, Long contentLength)Document(String id, Project project, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Set<Pipeline.Type> nerTags, Long contentLength)Document(Project project, String id, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Set<Pipeline.Type> nerTags, Date extractionDate, String parentDocument, String rootDocument, Short extractionLevel, Long contentLength)Document(Project project, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Long contentLength)Document(Project project, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, HashSet<Pipeline.Type> nerTags, Document parentDocument, Long contentLength)Document(Project project, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Set<Pipeline.Type> nerTags, Long contentLength)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanequals(Object o)static Set<Pipeline.Type>fromNerMask(int mask)StringgetContent()CharsetgetContentEncoding()LonggetContentLength()intgetContentTextLength()StringgetContentType()DategetCreationDate()PathgetDirname()DategetExtractionDate()shortgetExtractionLevel()StringgetId()LanguagegetLanguage()Map<String,Object>getMetadata()StringgetName()shortgetNerMask()Set<Pipeline.Type>getNerTags()StringgetParentDocument()PathgetPath()ProjectgetProject()StringgetProjectId()StringgetRootDocument()Document.StatusgetStatus()Set<Tag>getTags()inthashCode()booleanisRootDocument()StringtoString()
-
-
-
Constructor Detail
-
Document
public Document(Project project, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Long contentLength)
-
Document
public Document(Project project, String id, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Set<Pipeline.Type> nerTags, Date extractionDate, String parentDocument, String rootDocument, Short extractionLevel, Long contentLength)
-
Document
public Document(Project project, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Set<Pipeline.Type> nerTags, Long contentLength)
-
Document
public Document(String id, Project project, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, Set<Pipeline.Type> nerTags, Long contentLength)
-
Document
public Document(Project project, Path filePath, String content, Language language, Charset charset, String mimetype, Map<String,Object> metadata, Document.Status status, HashSet<Pipeline.Type> nerTags, Document parentDocument, Long contentLength)
-
-
Method Detail
-
getProject
public Project getProject()
-
getProjectId
public String getProjectId()
-
getContent
public String getContent()
-
getContentTextLength
public int getContentTextLength()
-
getPath
public Path getPath()
-
getDirname
public Path getDirname()
-
getExtractionDate
public Date getExtractionDate()
-
getContentEncoding
public Charset getContentEncoding()
-
getContentLength
public Long getContentLength()
-
getContentType
public String getContentType()
-
getLanguage
public Language getLanguage()
-
getExtractionLevel
public short getExtractionLevel()
-
getRootDocument
public String getRootDocument()
-
isRootDocument
public boolean isRootDocument()
-
getParentDocument
public String getParentDocument()
-
getStatus
public Document.Status getStatus()
-
getNerTags
public Set<Pipeline.Type> getNerTags()
-
getCreationDate
public Date getCreationDate()
-
getNerMask
public short getNerMask()
-
getName
public String getName()
-
fromNerMask
public static Set<Pipeline.Type> fromNerMask(int mask)
-
-