-
filenamePrefix
String filenamePrefix
The prefix of the files to create inside the HDFS directory, i.e. a value of "topicA"
will result in files named topicA-, topicA-, etc being produced
-
fileExtension
String fileExtension
The extension to add to the files written to HDFS, e.g. '.txt', '.seq', etc.
-
separator
char separator
The character to use to separate records in a text file. If no value is provided
then the content from all of the records will be concatenated together in one continuous
byte array.
-
syncInterval
long syncInterval
The interval (in milliseconds) between calls to flush data to HDFS disk.
-
maxPendingRecords
int maxPendingRecords
The maximum number of records that we hold in memory before acking. Default is Integer.MAX_VALUE.
Setting this value to one, results in every record being sent to disk before the record is acked,
while setting it to a higher values allows us to buffer records before flushing them all to disk.