|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object cascading.flow.BaseFlow<JobConf> cascading.flow.hadoop.HadoopFlow
public class HadoopFlow
Class HadoopFlow is the Apache Hadoop specific implementation of a Flow
.
HadoopFlowConnector
instance.
If classpath paths are provided on the FlowDef
, the Hadoop distributed cache mechanism will be used
to augment the remote classpath.
Any path elements that are relative will be uploaded to HDFS, and the HDFS URI will be used on the JobConf. Note
all paths are added as "files" to the JobConf, not archives, so they aren't needlessly uncompressed cluster side.
HadoopFlowConnector
Nested Class Summary |
---|
Nested classes/interfaces inherited from class cascading.flow.BaseFlow |
---|
cascading.flow.BaseFlow.FlowHolder |
Field Summary |
---|
Fields inherited from class cascading.flow.BaseFlow |
---|
flowStats, sinks, sources, stop, stopJobsOnExit, thread |
Fields inherited from interface cascading.flow.Flow |
---|
CASCADING_FLOW_ID |
Constructor Summary | |
---|---|
protected |
HadoopFlow()
|
|
HadoopFlow(cascading.flow.planner.PlatformInfo platformInfo,
Map<Object,Object> properties,
JobConf jobConf,
cascading.flow.FlowDef flowDef)
|
protected |
HadoopFlow(cascading.flow.planner.PlatformInfo platformInfo,
Map<Object,Object> properties,
JobConf jobConf,
String name)
|
Method Summary | |
---|---|
JobConf |
getConfig()
|
Map<Object,Object> |
getConfigAsProperties()
|
JobConf |
getConfigCopy()
|
cascading.flow.FlowProcess<JobConf> |
getFlowProcess()
|
protected int |
getMaxNumParallelSteps()
|
String |
getProperty(String key)
Method getProperty returns the value associated with the given key from the underlying properties system. |
protected void |
initConfig(Map<Object,Object> properties,
JobConf parentConfig)
|
protected void |
initFromProperties(Map<Object,Object> properties)
|
protected void |
internalClean(boolean stop)
|
protected void |
internalShutdown()
|
protected void |
internalStart()
|
boolean |
isPreserveTemporaryFiles()
Method isPreserveTemporaryFiles returns false if temporary files will be cleaned when this Flow completes. |
protected JobConf |
newConfig(JobConf defaultConfig)
|
protected void |
setConfigProperty(JobConf config,
Object key,
Object value)
|
boolean |
stepsAreLocal()
|
Methods inherited from class cascading.flow.BaseFlow |
---|
addListener, addStepListener, areSinksStale, areSourcesNewer, cleanup, complete, createConfig, createFlowThread, deleteCheckpointsIfNotUpdate, deleteCheckpointsIfReplace, deleteSinks, deleteSinksIfNotUpdate, deleteSinksIfReplace, deleteTrapsIfNotUpdate, deleteTrapsIfReplace, fireOnCompleted, fireOnStarting, fireOnStopping, fireOnThrowable, getCascadeID, getCascadingServices, getCheckpointNames, getCheckpoints, getCheckpointsCollection, getClassPath, getFieldsFor, getFlowSession, getFlowSkipStrategy, getFlowStats, getFlowSteps, getFlowStepStrategy, getHolder, getID, getName, getPlatformInfo, getRunID, getSink, getSink, getSinkModified, getSinkNames, getSinks, getSinksCollection, getSource, getSourceNames, getSources, getSourcesCollection, getSpawnStrategy, getStats, getSubmitPriority, getTags, getTrapNames, getTraps, getTrapsCollection, handleExecutorShutdown, hasListeners, hasStepListeners, initialize, initializeNewJobsMap, initSteps, internalStopAllJobs, isSkipFlow, isStopJobsOnExit, logInfo, openSink, openSink, openSource, openSource, openTapForRead, openTapForWrite, openTrap, openTrap, prepare, presentSinkFields, presentSourceFields, registerShutdownHook, removeListener, removeStepListener, resourceExists, retrieveSinkFields, retrieveSourceFields, setCascade, setCheckpoints, setFlowSkipStrategy, setFlowStepGraph, setFlowStepStrategy, setName, setSinks, setSources, setSpawnStrategy, setSubmitPriority, setTraps, start, stop, toString, updateSchemes, writeDOT, writeStepsDOT |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Constructor Detail |
---|
protected HadoopFlow()
protected HadoopFlow(cascading.flow.planner.PlatformInfo platformInfo, Map<Object,Object> properties, JobConf jobConf, String name)
public HadoopFlow(cascading.flow.planner.PlatformInfo platformInfo, Map<Object,Object> properties, JobConf jobConf, cascading.flow.FlowDef flowDef)
Method Detail |
---|
protected void initFromProperties(Map<Object,Object> properties)
initFromProperties
in class cascading.flow.BaseFlow<JobConf>
protected void initConfig(Map<Object,Object> properties, JobConf parentConfig)
initConfig
in class cascading.flow.BaseFlow<JobConf>
protected void setConfigProperty(JobConf config, Object key, Object value)
setConfigProperty
in class cascading.flow.BaseFlow<JobConf>
protected JobConf newConfig(JobConf defaultConfig)
newConfig
in class cascading.flow.BaseFlow<JobConf>
public JobConf getConfig()
public JobConf getConfigCopy()
public Map<Object,Object> getConfigAsProperties()
public String getProperty(String key)
key
- of type String
public cascading.flow.FlowProcess<JobConf> getFlowProcess()
public boolean isPreserveTemporaryFiles()
protected void internalStart()
internalStart
in class cascading.flow.BaseFlow<JobConf>
public boolean stepsAreLocal()
protected void internalClean(boolean stop)
internalClean
in class cascading.flow.BaseFlow<JobConf>
protected void internalShutdown()
internalShutdown
in class cascading.flow.BaseFlow<JobConf>
protected int getMaxNumParallelSteps()
getMaxNumParallelSteps
in class cascading.flow.BaseFlow<JobConf>
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |