Encapsulates a DataFrame created after stratified sampling.
Encapsulates an RDD over all the cached samples for a sampled table.
Encapsulates an RDD over all the cached samples for a sampled table. Parallelizes execution using the hashmap segment configuration on the nodes with each partition handling one or more segments of the hashmap on a node.
Snappy SQL extensions.
Snappy SQL extensions. Includes:
Stratified sample tables: 1) ERROR ESTIMATE AVG: error estimate for mean of a column/expression 2) ERROR ESTIMATE SUM: error estimate for sum of a column/expression