|
|||||||||
PREV NEXT | FRAMES NO FRAMES |
Serialization.accept(Class)
.
Tuple
to this list.
TupleEntry
into the outgoing stream.
Tuple
into the outgoing stream.
Tap
for use in the resulting Flow
.
Tap
named after the given Checkpoint
for use in the resulting Flow
.
Tap
pairs.
Flow
instance that is intended to participate in a Cascade
.
Flow
instances intended to participate in a Cascade
.
Flow
instances intended to participate in a Cascade
.
Flow
{2link FlowStats} instance.
Tap
for use in the resulting Flow
.
Tap
named after the given Pipe
for use in the resulting Flow
.
Tap
pairs.
Tap
for use in the resulting Flow
.
Tap
named after the given Pipe
for use in the resulting Flow
.
Tap
pairs.
Pipe
to this FlowDef that represents a tail in a pipe assembly.
Pipe
and sink Tap
to this FlowDef.
Operator
s to dynamically load classes and resources from a ClassLoader
.
Tap
for use in the resulting Flow
.
Tap
named after the given Pipe
for use in the resulting Flow
.
Tap
pairs.
TupleEntry
value in the current grouping.
TupleEntry
value in the current grouping.
SubAssembly
that serves two roles for handling aggregate operations.Function
.AggregateBy.CompositeFunction
class.Aggregator
invocation arguments.Filter
class that will logically 'and' the results of the constructor provided Filter
instances.Type
with a given field name or position.
Type
with a given field name or position
as declared in the given Fields parameter.
Flow
may or may not be required to have set.Class
if an instance of Class
or to Object
if not.
Iterable
instance that will coerce all Tuple elements
into the given type
parameter.
Flow
.Tuple
and each constructor value is .equals()
to its corresponding argument value.Tuple
is equal to the value
provided on the constructor.Tuple
values.GroupAssertion
that asserts the number of items in the current group
is equal the given size.GroupAssertion
that asserts the number of items in the current group
is less than the given size.GroupAssertion
that asserts the number of items in the current group
is more than the given size.Assertion
fails.Assertion
instance.Tuple
by joining each individual element of the Tuple with a tab character (\t).Tuple
element individually.Tuple
and each constructor value is not
.equals()
to its corresponding argument value.Tuple
is not a null value.Tuple
is a null value.Tuple
in the stream is exactly the given size.Tuple
in the stream has a size less than (<) the given size.Tuple
in the stream has a size more than (>) the given size."int"
or "java.lang.String"
to its corresponding Class
or instance of CoercibleType.
Aggregator
that returns the average of all numeric values in the current group.AggregateBy.Functor
that is used to count and sum observed duplicates from the tuple stream.Assertion
implementations.Tuple
streams via
the Each
or Every
Pipe
.Fields.UNKNOWN
Tuple
and accepts any number of arguments.
Serialization
interface for use
by BigDecimal
instances.Aggregator
by the fact that it operates on unique groups of values.Buffer
invocation arguments.Properties
instance with all property values for this type.
Properties
instance with all property values for this type
using the given Map of property values as defaults.
Properties
instance with all property values for this type
using the given IterableProperties
instance with all property values for this type
using the given Properties instance of property values as defaults.
Serialization
interface for use
by byte
arrays (byte[]
).Flow
instances that share or depend on equivalent Tap
instances and are executed as
a single group.Cascade
instance from a collection of Flow
instance.Cascade
.Cascade
class.Cascade
execution.Cascade
level properties passed
through a CascadeConnector
.Cascade
and Flow
instances via the CascadeConnector
and FlowConnector
, respectively.Cascade
specific statistics.FactoryLoader
utility.ServiceLoader
utility.split
array
will be updated in place.
SubAssembly
that will coerce all incoming Tuple
values to
the given types.CoercibleType
mapping available.
CoercibleType
.
CoercibleType
instances based on the
given field type information.
CoercibleType
instances based on the
given type array.
Fields
instances to be extended with custom
type information.CoercibleType
for the given Type
instance.
Joiner
.Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
RawComparator
.Partitioner
.Tap
this instance encapsulates.
Comparable.compareTo(Object)
.
Comparable.compareTo(Object)
method.
Comparator
implementations when required internally.Cascade.start()
was not previously called.
TupleEntry
has been passed to the
Aggregator.aggregate(cascading.flow.FlowProcess, AggregatorCall)
method.
Tap
classes as having child Tap instances.FunctionCall
, FilterCall
,
AggregatorCall
, ValueAssertionCall
, and GroupAssertionCall
.SourceCall
and SinkCall
.Flow
objects, it will connect them and return a new Cascade
instance.
Flow
objects, it will connect them and return a new Cascade
instance.
Flow
objects, it will connect them and return a new Cascade
instance.
Flow
objects, it will connect them and return a new Cascade
instance.
Aggregator
that calculates the number of items in the current group.AggregateBy.Functor
that is used to count observed duplicates from the tuple stream.Filter
that increments a given Enum
counter by 1 or by the given increment
value.Pattern
for removing quote characters from a String.
Pattern
cleaning quote escapes from a String.
value
with the given splitPattern
.
Pattern
for splitting a line of text into its component
parts using the given delimiter and quote Strings.
SimpleDateFormat
syntax.DateFormatter
and DateParser
.SimpleDateFormat
syntax.Calendar
field
values.
SimpleDateFormat
instance.
Calendar
field
values.
CoercibleType
.Filter
that will never remove an item from a stream, but will print the Tuple to either
stdout or stderr.Debug
instance.SinkMode.UPDATE
.
SinkMode.UPDATE
flag.
Tap
class that provides access to the Hadoop Distributed File System.SubAssembly
that will retain all incoming fields except those given on the constructor.DocumentService
instance.Function
or a Filter
to each entry in the Tuple
stream.Aggregator
or Buffer
to every grouping.Tuple
values.Tuple
values.ExpressionFunction
, ExpressionFilter
,
AssertExpression
.Comparable
types and the Comparable.compareTo(Object)
result is use for max/min
comparison.ProviderLoader
and is used to load
CascadingFactory
instances used by internal frameworks.Tuple
.Tap
sub-class that allows for direct local file access.Scheme
and file path
.
Scheme
,
file path
, and SinkMode
.
Tap
classes as representing a file like interface.Operation
as a filter, as opposed to being a Function
.Filter
invocation arguments.Tuple
is a null value.Tuple
is not a null value.Aggregator
that returns the first Tuple
encountered in a grouping.AggregateBy.Functor
that is used to count observed duplicates from the tuple stream.Pipe
instances connected to source
and sink Tap
instances.Flow
instance.
FlowConnector
specific
properties through the FlowConnector
constructor.Flow
.Flow
execution.Flow
specific properties through
a FlowConnector
.FlowSkipStrategy
implementation that returns
true
if the Flow sink exists, regardless if it is stale or not.FlowSkipStrategy
implementation that returns
true
if the Flow sink exists and is not stale in relation to the Flow sources.Flow
should
be skipped when executed in a Cascade
.Flow
specific statistics.FlowStep
specific statistics.FlowStep
configuration values
before they are submitted to the underlying platform.Formatter
class for formatting this tuples values into a new string.
Collections.frequency(java.util.Collection, Object)
.
Collections.frequency(java.util.Collection, Object)
.
FSInputStream
implementation that can verify a
MessageDigest
and will count the number of bytes read for use in progress status.Operation
as a function, as opposed to being a Filter
.Function
invocation arguments.Fields
of the expected arguments TupleEntry
.
TupleEntry
of argument values.
TupleEntry
of argument values.
TupleEntry
of argument values.
TupleEntry
of argument values.
TupleEntry
of argument values.
Iterator
of TupleEntry
instances representing the arguments for the called
Buffer.operate(cascading.flow.FlowProcess, BufferCall)
method.
blocksize
specified by the underlying file system for this resource.
Collection
of checkpoint Tap
s for this Flow object.
Map
of
key value pairs.
ConfigDef
instance that allows for local properties to be set and made available via
a resulting FlowProcess
instance when the pipe is invoked.
ConfigDef
instance that allows for local properties to be set and made available via
a resulting FlowProcess
instance when the tap is invoked.
Fields
declared by the current Operation
.
Fields
declared by the current Operation
.
Fields
declared by the current Operation
.
Serialization.getDeserializer(Class)
.
Flow
s executed by the Cascade.
Flow
instance.
FlowSkipStrategy
used by this Flow.
FlowSkipStrategy
used by this Flow.
FlowStepStrategy
instance.
TupleEntry
.
TupleEntry
.
TupleEntry
.
GroupBy
instance so that any custom properties
can be set on it via Pipe.getStepConfigDef()
.
Flow
ID value.
TupleEntry
to be populated
with the input values from SourceCall.getInput()
.
JobClient
managing this Hadoop job.
Each
or
Every
Operator.
TupleEntry
to be passed to the
SinkCall.getOutput()
output handler.
TupleEntryCollector
used to emit result values.
TupleEntryCollector
used to emit result values.
TupleEntryCollector
used to emit result values.
Formatter
format String of this TemplateTap object.
PlatformInfo
for the underlying platform.
replication
specified by the underlying file system for
this resource.
RunningJob
managing this Hadoop job.
Serialization.getSerializer(Class)
.
SinkMode
}of this Tap object.
Tap
managed by this Flow instance.
Collection
of sink Tap
s for this Flow object.
Collection
of source Tap
s for this Flow object.
CascadingStats.Status
of this CascadingStats object.
ConfigDef
instance that allows for process level properties to be set and made available via
a resulting FlowProcess
instance when the pipe is invoked.
ConfigDef
instance that allows for process level properties to be set and made available via
a resulting FlowProcess
instance when the tap is invoked.
HfsProps
Collection
of trap Tap
s for this Flow object.
MultiSourceTap
that accepts Hadoop style 'file globing' expressions so
multiple files that match the given pattern may be used as the input sources for a given Flow
.Assertion
used with the Every
pipe Operator.GroupAssertion
invocation arguments.Tuple
stream by the given groupFields.Fields.ALL
fields.
RawComparator
.Partitioner
.RawComparator
.Flow
.Tap
instances with an assembly of Pipe
instances into
an executable HadoopFlow
for execution on an Apache Hadoop cluster.Properties
instance as
default value for the underlying jobs.
FlowProcess
for Hadoop.Iterable
object that can store an unlimited number of Tuple
instances by spilling
excess to a temporary disk file.TupleEntryCollector
that writes tuples to the resource managed by
a particular Tap
instance.true
if there are properties in the configDef instance.
true
if there are properties in the configDef instance.
Comparator
implementation to also be delegated to during hashCode generation
during grouping partitioning.Joiner
when
all but one tuple stream is considered small enough to fit into memory.FlowListener
instances have been registered.
Tuple
instances available.
PlannedOperation
object with an associated
PlannerLevel
level.
true
if there are properties in the processConfigDef instance.
true
if there are properties in the processConfigDef instance.
Flow
may or may not be required to have set.FileSystem
for accessing remote HTTP and HTTPS data.InnerJoin.JoinIterator
that will iterate over a given Joiner
and return tuples that represent
and inner join of the CoGrouper internal grouped tuple collections.Aggregator
operation.
Fields.ALL
field set.
Fields.ALL
, Fields.GROUP
, or
Fields.VALUES
.
Fields.ARGS
field set.
Buffer
operation.
Fields.ALL
or Fields.UNKNOWN
.
true
if there are no properties.
CascadingStats.isStarted()
, CascadingStats.isSubmitted()
, or CascadingStats.isRunning()
returns true;
CascadingStats.isSkipped()
, CascadingStats.isSuccessful()
, CascadingStats.isFailed()
, or CascadingStats.isStopped()
returns true.
Fields.GROUP
field set.
Fields.NONE
field set.
FlowPlanner
to
remove all instances from the plan.
Fields.ALL
or Fields.RESULTS
.
Fields.REPLACE
field set.
Fields.RESULTS
field set.
true
if non-grouping fields will not be nulled after the argumentsIterator is completed.
true
if this Operation instance can safely execute on the same 'record' multiple
times, false
otherwise.
Cascade
should skip this Flow instance.
PlannerLevel
is less strict than this instance.
Fields.SWAP
field set.
true
if the sink fields equal the source fields.
Fields.UNKNOWN
field set.
Fields.VALUES
field set.
Iterator
over this Tuple instances values.
ObjectSerializer
.CoGroup
.Aggregator
that returns the last Tuple
encountered.Tuple.remove(int[])
.
LeftJoin.JoinIterator
that will iterate over a given Joiner
and return tuples that represent
a left inner, right outer join of the CoGrouper internal grouped tuple collections.Tap
class that provides access to the Local File System via Hadoop.Filter
that will limit the number of Tuple
instances that it will
allow to pass.Flow
.Tap
instances with an assembly of Pipe
instances into
an executable LocalFlow
for execution in local memory.FlowProcess
.FlowStep
.Filter
operations.HadoopFlow
subclass that supports custom MapReduce jobs
pre-configured via the JobConf
object.JobConf
and allows it to be executed as a Flow
.CascadingStats.Status.FAILED
.
CascadingStats.Status.PENDING
.
CascadingStats.Status.RUNNING
.
CascadingStats.Status.SKIPPED
.
CascadingStats.Status.STARTED
.
CascadingStats.Status.STARTED
then CascadingStats.Status.RUNNING
and forces the start and running time to be equals.
CascadingStats.Status.STOPPED
.
CascadingStats.Status.SUBMITTED
.
CascadingStats.Status.SUCCESSFUL
.
Aggregator
that returns the maximum value encountered in the
current group.Aggregator
that returns the minimum value encountered in the current group.Iterator
that will iterate over a given
Joiner
and return tuples that represent a join as defined by the given boolean array.CloseableIterator
.CompositeTap
and SinkTap
that can write to
multiple child Tap
instances simultaneously.Tap
instances into a single resource.Tuple
.
Filter
class that will logically 'not' (negation) the results of the constructor provided Filter
instance.null
.
Scheme
that neither reads or writes any values.Flow
throws a Throwable type.
FlowStep
throws a Throwable type.
TupleEntryIterator
for the given Tap instance.
TupleEntryIterator
for the given Tap instance.
Tuple
streams.BaseOperation
classes.Pipe
.Filter
class that will logically 'or' the results of the constructor provided Filter
instances.OuterJoin.JoinIterator
that will iterate over a given Joiner
and return tuples that represent
and outer join of the CoGrouper internal grouped tuple collections.Each
, Every
, GroupBy
,
CoGroup
, Merge
, HashJoin
, and SubAssembly
.Operation
implementations
that use a PlannerLevel
value to inform the FlowPlanner
how to treat the operation
during job planning.DebugLevel
and AssertionLevel
enum types.Cascade
to notify the given Flow it should initialize or clear any resources
necessary for Flow.start()
to be called successfully.
Flow
subclass that supports custom Riffle jobs.RecordReader
instance
via the Iterator
.groups
are stored in the appropriate number of new fields.
groups
are stored in the named fieldDeclarations.
SubAssembly
that will rename the fromFields to the names in toFields.SubAssembly
that will discard all incoming fields except those given on the constructor.RawComparator
.RightJoin.JoinIterator
that will iterate over a given Joiner
and return tuples that represent
a left outer, right inner join of the CoGrouper internal grouped tuple collections.Filter
that only allows the given fraction of Tuple
instances to pass.Tap
instance by declaring the Tuple
field names, and alternately parsing or rendering the incoming or outgoing Tuple
stream, respectively.Tuple
values.Tuple
values.Tuple
values.Scheme
, which is a flat file consisting of
binary key/value pairs.TupleInputStream
and TupleOutputStream
to substitute Integer values for a class name when writing out nested objects inside a Tuple
.ProviderLoader
and is used to load
CascadingService
instances used by internal frameworks.AssertionLevel
.
Comparator
with a given field name or position.
DebugLevel
.
Comparator
to be used if no Comparator can be found for the class via the
Comparison
interface.
Parser.setFeature(java.lang.String, boolean)
.
FlowSkipStrategy
, the current strategy, if any, is returned.
FlowSkipStrategy
, the current strategy is returned.
FlowStepStrategy
instance.
ConfigDef.Mode.REPLACE
mode.
ConfigDef.Mode
value.
false
if at the end of all values iterated over in the argumentsIterator, the last seen argument tuple
values should not be nulled out.
Tap
instance for the give process incoming and outgoing dependencies.
HfsProps
CombineFileInputFormat
for the input format.
CombineFileInputFormat
for the input format.
Function
that allows for a Tuple value to be returned based on the outcome
of a given Filter
operation.Tuple
found on SinkCall.getOutgoingEntry()
to
the SinkCall.getOutput()
.
Scheme.sink(cascading.flow.FlowProcess, SinkCall)
invocation
arguments.Scheme.sinkPrepare(cascading.flow.FlowProcess, SinkCall)
.
Scheme.sink(cascading.flow.FlowProcess, SinkCall)
.
SourceCall.getInput()
and populate
the available Tuple
via SourceCall.getIncomingEntry()
and return true
on success or false
if no more values available.
Scheme.source(cascading.flow.FlowProcess, SourceCall)
invocation
arguments.Scheme.sourcePrepare(cascading.flow.FlowProcess, SourceCall)
.
Scheme.source(cascading.flow.FlowProcess, SourceCall)
.
FlowConnector
before creating new Flow
instances.threshold
is met.SpillableTupleList
instance.GroupBy
, CoGroup
, Merge
, and HashJoin
.Fields.ALL
fields.
Filter
that sets the current FlowProcess
'status' on
the first Tuple
it sees.stderr
stream.stdin
stream.stdout
stream.InputStream
instances to be compared, typically bit-wise.FileSystem
implementations that manage remote resources.Pipe
s so they my be reused in the same manner
a Pipe is used.Aggregator
that returns the sum of all numeric values in the current group.AggregateBy.Functor
that is used to sum observed duplicates from the tuple stream.Flow
.Tap
and Scheme
subclasses.Tap
instance for use internally.Tuple
instance.Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Tuple
instance.FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
FileTap
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
HfsProps.TEMPORARY_DIRECTORY
TextLine
.Fields.UNKNOWN
, sinking
Fields.ALL
and using TAB as the default delimiter.
Fields.UNKNOWN
, sinking
Fields.ALL
and using TAB as the default delimiter.
Fields.UNKNOWN
, sinking
Fields.ALL
and using TAB as the default delimiter.
Fields.UNKNOWN
, sinking
Fields.ALL
and using the given delimitedParser instance for parsing.
Fields.UNKNOWN
, sinking
Fields.ALL
and using the given delimitedParser instance for parsing.
Fields.UNKNOWN
, sinking
Fields.ALL
and using the given delimitedParser instance for parsing.
Fields.UNKNOWN
, sinking
Fields.ALL
and using the given delimitedParser instance for parsing.
Fields.UNKNOWN
, sinking
Fields.ALL
and using TAB as the default delimiter.
Fields.UNKNOWN
, sinking
Fields.ALL
and using TAB as the default delimiter.
Fields.UNKNOWN
, sinking
Fields.ALL
and using TAB as the default delimiter.
Fields.UNKNOWN
, sinking
Fields.ALL
and using TAB as the default delimiter.
Fields.UNKNOWN
, sinking
Fields.ALL
and using the given delimitedParser instance for parsing.
Fields.UNKNOWN
, sinking
Fields.ALL
and using the given delimitedParser instance for parsing.
Scheme
for plain text files.Scheme
for plain text files.AssertionException
.
AssertionException
.
Tuple
instance and its declaring Fields
instance to be used as a single object.BaseOperation
instances to emit result Tuple
values.TupleEntry
elements in an
underlying Tuple
collection.Scheme
instance, calling
Scheme.sink(cascading.flow.FlowProcess, cascading.scheme.SinkCall)
on every call to TupleEntrySchemeCollector.add(TupleEntry)
or TupleEntrySchemeCollector.add(Tuple)
.Scheme
instance, calling
Scheme.source(cascading.flow.FlowProcess, cascading.scheme.SourceCall)
on every call to
TupleEntrySchemeIterator.next()
.Tuple
, Fields
and TupleEntry
classes.Tuple
and TupleEntry
instances.Serialization
interface.FlowConnector
before creating new Flow
instances.Function
that will 'un-group' data from a given dataset.SubAssembly
is used to filter all duplicates out of a tuple stream.Filter
that is used to remove observed duplicates from the tuple stream.Flow
and Cascade
allowing them to be used interchangeably.Executors.newFixedThreadPool(int)
ExecutorService
to spawn threads.UnitOfWork
class.Assertion
used with the Each
pipe Operator.ValueAssertion
invocation arguments.SequenceFile
that reads and writes values of the given
writableType
Class
, instead of Tuple
instances used by default in SequenceFile.Filter
class that will logically 'xor' (exclusive or) the results of the
constructor provided Filter instances.
|
|||||||||
PREV NEXT | FRAMES NO FRAMES |