|
|||||||||
PREV NEXT | FRAMES NO FRAMES |
Serialization.accept(Class)
.
Tuple
to this list.
TupleEntry
to this list.
TupleEntry
into the outgoing stream.
Tuple
into the outgoing stream.
Flow
{2link FlowStats} instance.
TupleEntry
value in the current grouping.
TupleEntry
value in the current grouping.
SubAssembly
that serves two roles for handling aggregate operations.Function
.AggregateBy.CompositeFunction
class.Aggregator
invocation arguments.Filter
class that will logically 'and' the results of the constructor provided Filter
instances.Tuple
instance into an Object[].
Tuple
instance into an Object[].
Tuple
and each constructor value is .equals()
to its corresponding argument value.Tuple
is equal to the value
provided on the constructor.Tuple
values.GroupAssertion
that asserts the number of items in the current group
is equal the given size.GroupAssertion
that asserts the number of items in the current group
is less than the given size.GroupAssertion
that asserts the number of items in the current group
is more than the given size.Assertion
fails.Assertion
instance.Tuple
by joining each individual element of the Tuple with a tab character (\t).Tuple
element individually.Tuple
and each constructor value is not
.equals()
to its corresponding argument value.Tuple
is not a null value.Tuple
is a null value.Tuple
in the stream is exactly the given size.Tuple
in the stream has a size less than (<) the given size.Tuple
in the stream has a size more than (>) the given size.Aggregator
that returns the average of all numeric values in the current group.AggregateBy.Functor
that is used to count and sum observed duplicates from the tuple stream.Assertion
implementations.Tuple
streams via
the Each
or Every
Pipe
.Fields.UNKNOWN
Tuple
and accepts any number of arguments.
Aggregator
by the fact that it operates on unique groups of values.Buffer
invocation arguments.Serialization
interface for use
by byte
arrays (byte[]
).Flow
instances that share or depend on equivalent Tap
instances and are executed as
a single group.Cascade
instance from a collection of Flow
instance.Cascade
class.Cascade
and Flow
instances via the CascadeConnector
and FlowConnector
, respectively.Cascade
specific statistics.split
array
will be updated in place.
SubAssembly
that will coerce all incoming Tuple
values to
the given types.Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
Pipe
instance.
RawComparator
.Partitioner
.Tap
this instance encapsulates.
Comparable.compareTo(Object)
.
Comparable.compareTo(Object)
method.
Comparator
implementations when required internally.Cascade.start()
was not previously called.
TupleEntry
has been passed to the
Aggregator.aggregate(cascading.flow.FlowProcess, AggregatorCall)
method.
FunctionCall
, FilterCall
,
AggregatorCall
, ValueAssertionCall
, and GroupAssertionCall
.Flow
objects, it will connect them and return a new Cascade
instance.
Flow
objects, it will connect them and return a new Cascade
instance.
Flow
objects, it will connect them and return a new Cascade
instance.
Flow
objects, it will connect them and return a new Cascade
instance.
Aggregator
that calculates the number of items in the current group.AggregateBy.Functor
that is used to count observed duplicates from the tuple stream.Filter
that increments a given Enum
counter by 1 or by the given increment
value.Pattern
for removing quote characters from a String.
Pattern
cleaning quote escapes from a String.
value
with the given splitPattern
.
Pattern
for splitting a line of text into its component
parts using the given delimiter and quote Strings.
SimpleDateFormat
syntax.DateFormatter
and DateParser
.SimpleDateFormat
syntax.Calendar
field
values.
SimpleDateFormat
instance.
Calendar
field
values.
Filter
that will never remove an item from a stream, but will print the Tuple to either
stdout or stderr.Debug
instance.SinkMode.UPDATE
.
SinkMode.UPDATE
flag.
Tap
class that provides access to the Hadoop Distributed File System.Function
or a Filter
to each entry in the Tuple
stream.Aggregator
or Buffer
to every grouping.Tuple
values.Tuple
values.ExpressionFunction
, ExpressionFilter
,
AssertExpression
.Tuple
.Operation
as a filter, as opposed to being a Function
.Filter
invocation arguments.Tuple
is a null value.Tuple
is not a null value.Aggregator
that returns the first Tuple
encountered.Pipe
assembly is connected to the necessary number of Tap
sinks and
sources into a Flow.Tuple
instances
between operations.Tap
instances with an assembly of Pipe
instances into
an executable Flow
.Properties
instance as
default value for the underlying jobs.
Flow
instance.
Flow
execution.FlowSkipStrategy
implementation that returns
true
if the Flow sink exists, regardless if it is stale or not.FlowSkipStrategy
implementation that returns
true
if the Flow sink exists and is stale in relation to the Flow sources.Flow
should
be skipped when executed in a Cascade
.Flow
specific statistics.Formatter
class for formatting this tuples values into a new string.
FSInputStream
implementation that can verify a
MessageDigest
and will count the number of bytes read for use in progress status.BaseOperation
as a function, as opposed to being a Filter
.Function
invocation arguments.Fields
of the expected arguments TupleEntry
.
TupleEntry
of argument values.
TupleEntry
of argument values.
TupleEntry
of argument values.
TupleEntry
of argument values.
TupleEntry
of argument values.
TupleEntry
for the declared arguments of this scope.
Iterator
of TupleEntry
instances representing the arguments for the called
Buffer.operate(cascading.flow.FlowProcess, BufferCall)
method.
AssertionLevel
.
DebugLevel
.
Serialization.getDeserializer(Class)
.
Flow
s executed by the Cascade.
Flow
instance.
FlowSkipStrategy
used by this Flow.
FlowSkipStrategy
used by this Flow.
TupleEntry
.
TupleEntry
.
TupleEntry
.
Every.EveryHandler
for this instnce.
Each
or
Every
Operator.
TupleEntryCollector
used to emit result values.
TupleEntryCollector
used to emit result values.
TupleEntryCollector
used to emit result values.
Formatter
format String of this TemplateTap object.
Serialization.getSerializer(Class)
.
SinkMode
}of this Tap object.
Tap
managed by this Flow instance.
Collection
of sink Tap
s for this Flow object.
Collection
of source Tap
s for this Flow object.
ZipInputFormat.listPathsInternal(JobConf)
.
Collection
of trap Tap
s for this Flow object.
MultiSourceTap
that accepts Hadoop style 'file globbing' expressions so
multiple files that match the given pattern may be used as the input sources for a given Flow
.GroupBy
and CoGroup
.Fields.ALL
fields.
Assertion
used with the Every
pipe Operator.GroupAssertion
invocation arguments.Tuple
stream by the given groupFields.Fields.ALL
fields.
RawComparator
.Partitioner
.RawComparator
.FlowProcess
for Hadoop.FlowListener
instances have been registered.
Tuple
instances available.
PlannedOperation
object with an associated
PlannerLevel
level.
true
if there are properties associated with this FlowStep.
FileSystem
for accessing remote HTTP and HTTPS data.InnerJoin.JoinIterator
that will iterate over a given Joiner
and return tuples that represent
and inner join of the CoGrouper internal grouped tuple collections.Aggregator
operation.
Fields.ALL
field set.
Fields.ALL
, Fields.GROUP
, or
Fields.VALUES
.
Fields.ARGS
field set.
Buffer
operation.
Fields.ALL
or Fields.UNKNOWN
.
Fields.GROUP
field set.
FlowPlanner
to
remove all instances from the plan.
Fields.ALL
or Fields.RESULTS
.
Fields.REPLACE
field set.
Fields.RESULTS
field set.
true
if this Operation instance can safely execute on the same 'record' multiple
times, false
otherwise.
Cascade
should skip this Flow instance.
PlannerLevel
is less strict than this instance.
Fields.SWAP
field set.
true
if the sink fields equal the source fields.
Fields.UNKNOWN
field set.
Fields.VALUES
field set.
Tap
instances TupleEntryCollector
should be used to sink values.
TupleEntryCollector
should be used to sink values.
Tuple
instance.
Iterator
over this Tuple instances values.
CoGroupClosure
.Aggregator
that returns the last Tuple
encountered.Tuple.remove(int[])
.
LeftJoin.JoinIterator
that will iterate over a given Joiner
and return tuples that represent
a left inner, right outer join of the CoGrouper internal grouped tuple collections.Tap
class that provides access to the Local File System via Hadoop.Filter
that will limit the number of Tuple
instances that it will
allow to pass.Filter
operations.Flow
subclass that supports custom MapReduce jobs pre-configured via the JobConf
object.JobConf
and allows it to be executed as a Flow
.Aggregator
that returns the maximum value encountered in the current group.Aggregator
that returns the minimum value encountered in the current group.Iterator
that will iterate over a given
Joiner
and return tuples that represent a join as defined by the given boolean array.CompositeTap
and SinkTap
that can write to multiple child Tap
instances simultaneously.Tap
instances into a single resource.Tuple
.
Filter
class that will logically 'not' (negation) the results of the constructor provided Filter
instance.FlowStep
throws a Throwable type.
TupleIterator
for the given Tap instance.
TupleIterator
for the given Tap instance.
Tuple
streams.BaseOperation
classes.Pipe
.Filter
class that will logically 'or' the results of the constructor provided Filter
instances.OuterJoin.JoinIterator
that will iterate over a given Joiner
and return tuples that represent
and outer join of the CoGrouper internal grouped tuple collections.Each
, Every
, GroupBy
, and
CoGroup
.Operation
implementations
that use a PlannerLevel
value to inform the FlowPlanner
how to treat the operation
during job planning.DebugLevel
and AssertionLevel
enum types.Cascade
to notify the given Flow it should initialize or clear any resources
necessary for Flow.start()
to be called successfully.
Flow
subclass that supports custom Riffle jobs.groups
are stored in the appropriate number of new fields.
groups
are stored in the named fieldDeclarations.
Pipe
or SubAssembly
.
SubAssembly
that will rename the fromFields to the names in toFields.RawComparator
.RightJoin.JoinIterator
that will iterate over a given Joiner
and return tuples that represent
a left outer, right inner join of the CoGrouper internal grouped tuple collections.Filter
that only allows the given fraction of Tuple
instances to pass.Tap
instance by declaring the Tuple
field names, and alternately parsing or rendering the incoming or outgoing Tuple
stream, respectively.Scheme
, which is a flat file consisting of
binary key/value pairs.TupleInputStream
and TupleOutputStream
to substitute Integer values for a class name when writing out nested objects inside a Tuple
.AssertionLevel
.
Comparator
with a given field name or position.
DebugLevel
.
Parser.setFeature(java.lang.String, boolean)
.
FlowSkipStrategy
, the current strategy, if any, is returned.
FlowSkipStrategy
, the current strategy is returned.
Tap
instance for the give process incoming and outgoing dependencies.
Function
that allows for a Tuple value to be returned based on the outcome
of a given Filter
operation.TupleEntryCollector
should be used to sink values.
SubAssembly
that will modify the input Tuple stream to match the given fields.Tuple
instance to the outputCollector.
TemplateTap
.Tuple
instance.
Tuple
MultiSourceTap
.Iterable
object that can store an unlimited number of Tuple
instances by spilling
excess to a temporary disk file.Filter
that sets the current FlowProcess
'status' on
the first Tuple
it sees.FlowStep
instances.FlowStep
specific statistics.InputStream
instances to be compared, typically bit-wise.FileSystem
implementations that manage remote resources.Pipe
s so they my be reused in the same manner
a Pipe is used.Aggregator
that returns the sum of all numeric values in the current group.AggregateBy.Functor
that is used to sum observed duplicates from the tuple stream.Flow
.TupleEntryCollector
that writes tuples to the resource managed by
a particular Tap
instance.Tap
subclasses.TupleIterator
.Tap
instance for use internally.Tuple
instance.Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
Hfs
Tap as the
base path and default Scheme
, and the pathTemplate as the Formatter
format String.
TextLine
.Scheme
for plain text files.AssertionException
.
AssertionException
.
Object.toString()
of the given object, or an empty String if the object
is null.
Tuple
instance and its declarating Fields
instance to be used as a single object.BaseOperation
instances to emit result Tuple
values.TupleEntry
elements in an
underlying Tuple
collection.Tuple
, Fields
and TupleEntry
classes.Tuple
instances.Tuple
and TupleEntry
instances.Serialization
interface.Function
that will 'un-group' data from a given dataset.SubAssembly
is used to filter all duplicates out of a tuple stream.Filter
that is used to remove observed duplicates from the tuple stream.Assertion
used with the Each
pipe Operator.ValueAssertion
invocation arguments.SequenceFile
that reads and writes values of the given
writableType
Class
, instead of Tuple
instances used by default in SequenceFile.Filter
class that will logically 'xor' (exclusive or) the results of the
constructor provided Filter instances.InputFormat
for zip files.
|
|||||||||
PREV NEXT | FRAMES NO FRAMES |