Class RddRdfWriter2
java.lang.Object
net.sansa_stack.spark.io.rdf.output.RddRdfWriter2
Core class for configuration and execution of writing RDDs of RDF out using Hadaop.
An RDD's number splits into the hadoop conf and the OutputFormat uses it to decide
whether to output header / footer data blocks on the first / last partitions.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprotected booleanprotected org.apache.jena.shared.PrefixMappingprotected org.apache.jena.riot.RDFFormat -
Constructor Summary
ConstructorsConstructorDescriptionRddRdfWriter2(org.apache.jena.riot.RDFFormat rdfFormat, boolean mapQuadsToTriplesForTripleLangs, org.apache.jena.shared.PrefixMapping prefixes) -
Method Summary
Modifier and TypeMethodDescriptionprotected org.apache.hadoop.conf.ConfigurationbuildConfiguration(org.apache.spark.rdd.RDD<?> rdd) protected voidconfigure(org.apache.hadoop.conf.Configuration conf) static <T> org.apache.spark.api.java.JavaPairRDD<Long,T> toPairRdd(org.apache.spark.api.java.JavaRDD<T> rdd) voidwriteQuads(org.apache.spark.rdd.RDD<org.apache.jena.sparql.core.Quad> rdd, org.apache.hadoop.fs.Path path) voidwriteTriples(org.apache.spark.rdd.RDD<org.apache.jena.graph.Triple> rdd, org.apache.hadoop.fs.Path path)
-
Field Details
-
rdfFormat
protected org.apache.jena.riot.RDFFormat rdfFormat -
mapQuadsToTriplesForTripleLangs
protected boolean mapQuadsToTriplesForTripleLangs -
prefixes
protected org.apache.jena.shared.PrefixMapping prefixes
-
-
Constructor Details
-
Method Details
-
writeTriples
public void writeTriples(org.apache.spark.rdd.RDD<org.apache.jena.graph.Triple> rdd, org.apache.hadoop.fs.Path path) -
writeQuads
public void writeQuads(org.apache.spark.rdd.RDD<org.apache.jena.sparql.core.Quad> rdd, org.apache.hadoop.fs.Path path) -
buildConfiguration
protected org.apache.hadoop.conf.Configuration buildConfiguration(org.apache.spark.rdd.RDD<?> rdd) -
configure
protected void configure(org.apache.hadoop.conf.Configuration conf) -
toPairRdd
public static <T> org.apache.spark.api.java.JavaPairRDD<Long,T> toPairRdd(org.apache.spark.api.java.JavaRDD<T> rdd)
-