|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.hadoop.hive.ql.exec.tez.DagUtils
public class DagUtils
DagUtils. DagUtils is a collection of helper methods to convert map and reduce work to tez vertices and edges. It handles configuration objects, file localization and vertex/edge creation.
| Method Summary | |
|---|---|
void |
addCredentials(BaseWork work,
org.apache.tez.dag.api.DAG dag)
Set up credentials for the base work on secure clusters |
org.apache.hadoop.mapred.JobConf |
createConfiguration(HiveConf hiveConf)
Creates and initializes a JobConf object that can be used to execute the DAG. |
org.apache.tez.dag.api.Edge |
createEdge(org.apache.hadoop.mapred.JobConf vConf,
org.apache.tez.dag.api.Vertex v,
org.apache.hadoop.mapred.JobConf wConf,
org.apache.tez.dag.api.Vertex w,
TezEdgeProperty edgeProp)
Given two vertices and their respective configuration objects createEdge will create an Edge object that connects the two. |
org.apache.tez.dag.api.GroupInputEdge |
createEdge(org.apache.tez.dag.api.VertexGroup group,
org.apache.hadoop.mapred.JobConf wConf,
org.apache.tez.dag.api.Vertex w,
TezEdgeProperty edgeProp)
Given a Vertex group and a vertex createEdge will create an Edge between them. |
org.apache.tez.client.PreWarmContext |
createPreWarmContext(org.apache.tez.client.TezSessionConfiguration sessionConfig,
int numContainers,
Map<String,org.apache.hadoop.yarn.api.records.LocalResource> localResources)
|
org.apache.hadoop.fs.Path |
createTezDir(org.apache.hadoop.fs.Path scratchDir,
org.apache.hadoop.conf.Configuration conf)
createTezDir creates a temporary directory in the scratchDir folder to be used with Tez. |
org.apache.tez.dag.api.Vertex |
createVertex(org.apache.hadoop.mapred.JobConf conf,
BaseWork work,
org.apache.hadoop.fs.Path scratchDir,
org.apache.hadoop.yarn.api.records.LocalResource appJarLr,
List<org.apache.hadoop.yarn.api.records.LocalResource> additionalLr,
org.apache.hadoop.fs.FileSystem fileSystem,
Context ctx,
boolean hasChildren,
TezWork tezWork)
Create a vertex from a given work object. |
String |
getBaseName(org.apache.hadoop.yarn.api.records.LocalResource lr)
|
org.apache.hadoop.fs.Path |
getDefaultDestDir(org.apache.hadoop.conf.Configuration conf)
|
String |
getExecJarPathLocal()
|
org.apache.hadoop.fs.FileStatus |
getHiveJarDirectory(org.apache.hadoop.conf.Configuration conf)
|
static DagUtils |
getInstance()
Singleton |
String |
getResourceBaseName(org.apache.hadoop.fs.Path path)
|
static String[] |
getTempFilesFromConf(org.apache.hadoop.conf.Configuration conf)
|
org.apache.hadoop.fs.Path |
getTezDir(org.apache.hadoop.fs.Path scratchDir)
Gets the tez dir that belongs to the hive scratch dir |
org.apache.hadoop.mapred.JobConf |
initializeVertexConf(org.apache.hadoop.mapred.JobConf conf,
BaseWork work)
Creates and initializes the JobConf object for a given BaseWork object. |
org.apache.hadoop.yarn.api.records.LocalResource |
localizeResource(org.apache.hadoop.fs.Path src,
org.apache.hadoop.fs.Path dest,
org.apache.hadoop.conf.Configuration conf)
|
List<org.apache.hadoop.yarn.api.records.LocalResource> |
localizeTempFiles(String hdfsDirPathStr,
org.apache.hadoop.conf.Configuration conf,
String[] inputOutputJars)
Localizes files, archives and jars from a provided array of names. |
List<org.apache.hadoop.yarn.api.records.LocalResource> |
localizeTempFilesFromConf(String hdfsDirPathStr,
org.apache.hadoop.conf.Configuration conf)
Localizes files, archives and jars the user has instructed us to provide on the cluster as resources for execution. |
void |
updateConfigurationForEdge(org.apache.hadoop.mapred.JobConf vConf,
org.apache.tez.dag.api.Vertex v,
org.apache.hadoop.mapred.JobConf wConf,
org.apache.tez.dag.api.Vertex w)
Given two vertices a, b update their configurations to be used in an Edge a-b |
static org.apache.hadoop.fs.FileStatus |
validateTargetDir(org.apache.hadoop.fs.Path path,
org.apache.hadoop.conf.Configuration conf)
|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method Detail |
|---|
public org.apache.tez.dag.api.GroupInputEdge createEdge(org.apache.tez.dag.api.VertexGroup group,
org.apache.hadoop.mapred.JobConf wConf,
org.apache.tez.dag.api.Vertex w,
TezEdgeProperty edgeProp)
throws IOException
group - The parent VertexGroupwConf - The job conf of the child vertexw - The child vertexedgeProp - the edge property of connection between the two
endpoints.
IOException
public void updateConfigurationForEdge(org.apache.hadoop.mapred.JobConf vConf,
org.apache.tez.dag.api.Vertex v,
org.apache.hadoop.mapred.JobConf wConf,
org.apache.tez.dag.api.Vertex w)
throws IOException
IOException
public org.apache.tez.dag.api.Edge createEdge(org.apache.hadoop.mapred.JobConf vConf,
org.apache.tez.dag.api.Vertex v,
org.apache.hadoop.mapred.JobConf wConf,
org.apache.tez.dag.api.Vertex w,
TezEdgeProperty edgeProp)
throws IOException
vConf - JobConf of the first vertexv - The first vertex (source)wConf - JobConf of the second vertexw - The second vertex (sink)
IOException
public org.apache.tez.client.PreWarmContext createPreWarmContext(org.apache.tez.client.TezSessionConfiguration sessionConfig,
int numContainers,
Map<String,org.apache.hadoop.yarn.api.records.LocalResource> localResources)
throws IOException,
org.apache.tez.dag.api.TezException
sessionConfig - session configurationnumContainers - number of containers to pre-warmlocalResources - additional resources to pre-warm with
IOException
org.apache.tez.dag.api.TezException
public org.apache.hadoop.fs.Path getDefaultDestDir(org.apache.hadoop.conf.Configuration conf)
throws LoginException,
IOException
conf -
LoginException - if we are unable to figure user information
IOException - when any dfs operation fails.
public List<org.apache.hadoop.yarn.api.records.LocalResource> localizeTempFilesFromConf(String hdfsDirPathStr,
org.apache.hadoop.conf.Configuration conf)
throws IOException,
LoginException
conf -
IOException - when hdfs operation fails
LoginException - when getDefaultDestDir fails with the same exceptionpublic static String[] getTempFilesFromConf(org.apache.hadoop.conf.Configuration conf)
public List<org.apache.hadoop.yarn.api.records.LocalResource> localizeTempFiles(String hdfsDirPathStr,
org.apache.hadoop.conf.Configuration conf,
String[] inputOutputJars)
throws IOException,
LoginException
hdfsDirPathStr - Destination directoty in HDFS.conf - Configuration.inputOutputJars - The file names to localize.
IOException - when hdfs operation fails.
LoginException - when getDefaultDestDir fails with the same exception
public org.apache.hadoop.fs.FileStatus getHiveJarDirectory(org.apache.hadoop.conf.Configuration conf)
throws IOException,
LoginException
IOException
LoginException
public static org.apache.hadoop.fs.FileStatus validateTargetDir(org.apache.hadoop.fs.Path path,
org.apache.hadoop.conf.Configuration conf)
throws IOException
IOException
public String getExecJarPathLocal()
throws URISyntaxException
URISyntaxExceptionpublic String getBaseName(org.apache.hadoop.yarn.api.records.LocalResource lr)
public String getResourceBaseName(org.apache.hadoop.fs.Path path)
pathStr - - the string from which we try to determine the resource base name
public org.apache.hadoop.yarn.api.records.LocalResource localizeResource(org.apache.hadoop.fs.Path src,
org.apache.hadoop.fs.Path dest,
org.apache.hadoop.conf.Configuration conf)
throws IOException
src - path to the source for the resourcedest - path in hdfs for the resourceconf -
IOException - when any file system related calls fails.
public org.apache.hadoop.mapred.JobConf createConfiguration(HiveConf hiveConf)
throws IOException
hiveConf - Current hiveConf for the execution
IOException
public org.apache.hadoop.mapred.JobConf initializeVertexConf(org.apache.hadoop.mapred.JobConf conf,
BaseWork work)
conf - Any configurations in conf will be copied to the resulting new JobConf object.work - BaseWork will be used to populate the configuration object.
public org.apache.tez.dag.api.Vertex createVertex(org.apache.hadoop.mapred.JobConf conf,
BaseWork work,
org.apache.hadoop.fs.Path scratchDir,
org.apache.hadoop.yarn.api.records.LocalResource appJarLr,
List<org.apache.hadoop.yarn.api.records.LocalResource> additionalLr,
org.apache.hadoop.fs.FileSystem fileSystem,
Context ctx,
boolean hasChildren,
TezWork tezWork)
throws Exception
conf - JobConf to be used to this execution unitwork - The instance of BaseWork representing the actual work to be performed
by this vertex.scratchDir - HDFS scratch dir for this execution unit.list - appJarLr - Local resource for hive-exec.additionalLr - fileSystem - FS corresponding to scratchDir and LocalResourcesctx - This query's context
Exception
public void addCredentials(BaseWork work,
org.apache.tez.dag.api.DAG dag)
public org.apache.hadoop.fs.Path createTezDir(org.apache.hadoop.fs.Path scratchDir,
org.apache.hadoop.conf.Configuration conf)
throws IOException
IOExceptionpublic org.apache.hadoop.fs.Path getTezDir(org.apache.hadoop.fs.Path scratchDir)
public static DagUtils getInstance()
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||