Class RddRdfWriterSettings<SELF extends RddRdfWriterSettings>
java.lang.Object
net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings<SELF>
- Direct Known Subclasses:
RddRdfWriter,RddRdfWriterFactory
-
Field Summary
FieldsModifier and TypeFieldDescriptionprotected booleanprotected Supplier<OutputStream>protected longOnly for console output: Instead of writing tuples out immediatly, collect up to this number of tuples in order to derive the used prefixes.protected booleanprotected org.apache.jena.shared.PrefixMappingprotected booleanWhether to convert quads to triples if a triple-based output format is requestedprotected org.apache.jena.riot.RDFFormatprotected org.apache.hadoop.fs.Pathprotected booleanprotected RdfPostProcessingSettingsMutableprotected org.apache.hadoop.fs.Pathprotected booleanprotected boolean -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionconfigureFrom(RddRdfWriterSettings<?> other) org.apache.jena.riot.RDFFormatorg.apache.jena.shared.PrefixMappingorg.apache.jena.riot.RDFFormatorg.apache.hadoop.fs.Pathorg.apache.hadoop.fs.PathbooleanbooleanIf neither partition folder nor targe file is set the output goes to the consolebooleanbooleanbooleanbooleanbooleanprotected SELFself()setAllowOverwriteFiles(boolean allowOverwriteFiles) setConsoleOutSupplier(Supplier<OutputStream> consoleOutSupplier) setDeferOutputForUsedPrefixes(long deferOutputForUsedPrefixes) setDeletePartitionFolderAfterMerge(boolean deletePartitionFolderAfterMerge) setGlobalPrefixMapping(Map<String, String> globalPrefixMap) setGlobalPrefixMapping(org.apache.jena.shared.PrefixMapping globalPrefixMapping) Set a prefix mapping to be used "globally" across all partitions.setMapQuadsToTriplesForTripleLangs(boolean mapQuadsToTriplesForTripleLangs) Whether to convert quads to triples if a triple-based output format is requested Jena by default discards any quad outside of the default graph when writing to a triple format.setOutputFormat(String formatName) Raises an exception if the format is not foundsetOutputFormat(org.apache.jena.riot.RDFFormat format) setPartitionFolder(String partitionFolder) setPartitionFolder(org.apache.hadoop.fs.Path partitionFolder) setPartitionsAsIndependentFiles(boolean partitionsAsIndependentFiles) setPostProcessingSettings(RdfPostProcessingSettingsMutable postProcessingSettings) setTargetFile(String targetFile) setTargetFile(org.apache.hadoop.fs.Path targetFile) voidsetUseCoalesceOne(boolean useCoalesceOne) setUseElephas(boolean useElephas)
-
Field Details
-
partitionFolder
protected org.apache.hadoop.fs.Path partitionFolder -
targetFile
protected org.apache.hadoop.fs.Path targetFile -
useCoalesceOne
protected boolean useCoalesceOne -
deletePartitionFolderAfterMerge
protected boolean deletePartitionFolderAfterMerge -
globalPrefixMapping
protected org.apache.jena.shared.PrefixMapping globalPrefixMapping -
outputFormat
protected org.apache.jena.riot.RDFFormat outputFormat -
allowOverwriteFiles
protected boolean allowOverwriteFiles -
useElephas
protected boolean useElephas -
partitionsAsIndependentFiles
protected boolean partitionsAsIndependentFiles -
mapQuadsToTriplesForTripleLangs
protected boolean mapQuadsToTriplesForTripleLangsWhether to convert quads to triples if a triple-based output format is requested -
postProcessingSettings
-
deferOutputForUsedPrefixes
protected long deferOutputForUsedPrefixesOnly for console output: Instead of writing tuples out immediatly, collect up to this number of tuples in order to derive the used prefixes. Upon reaching this threshold, print out all seen prefixes and emit the held-back data as well as any further data immediately -
consoleOutSupplier
-
-
Constructor Details
-
RddRdfWriterSettings
public RddRdfWriterSettings()
-
-
Method Details
-
isMapQuadsToTriplesForTripleLangs
public boolean isMapQuadsToTriplesForTripleLangs() -
self
-
configureFrom
-
setMapQuadsToTriplesForTripleLangs
Whether to convert quads to triples if a triple-based output format is requested Jena by default discards any quad outside of the default graph when writing to a triple format. Setting this flag to true will map each quad in a named graph to the default graph. -
isUseCoalesceOne
public boolean isUseCoalesceOne() -
setUseCoalesceOne
public void setUseCoalesceOne(boolean useCoalesceOne) -
isDeletePartitionFolderAfterMerge
public boolean isDeletePartitionFolderAfterMerge() -
setDeletePartitionFolderAfterMerge
-
getGlobalPrefixMapping
public org.apache.jena.shared.PrefixMapping getGlobalPrefixMapping() -
getPartitionFolder
public org.apache.hadoop.fs.Path getPartitionFolder() -
setPartitionFolder
-
setPartitionFolder
-
getTargetFile
public org.apache.hadoop.fs.Path getTargetFile() -
setTargetFile
-
setTargetFile
-
setGlobalPrefixMapping
-
getOutputFormat
public org.apache.jena.riot.RDFFormat getOutputFormat() -
setOutputFormat
-
setOutputFormat
Raises an exception if the format is not found -
getFallbackOutputFormat
public org.apache.jena.riot.RDFFormat getFallbackOutputFormat() -
isAllowOverwriteFiles
public boolean isAllowOverwriteFiles() -
setAllowOverwriteFiles
-
isUseElephas
public boolean isUseElephas() -
setUseElephas
-
isPartitionsAsIndependentFiles
public boolean isPartitionsAsIndependentFiles() -
setPartitionsAsIndependentFiles
-
setDeferOutputForUsedPrefixes
-
isConsoleOutput
public boolean isConsoleOutput()If neither partition folder nor targe file is set the output goes to the console -
setConsoleOutput
-
setConsoleOutSupplier
-
getConsoleOutSupplier
-
getPostProcessingSettings
-
setPostProcessingSettings
-