When joining two streams via a CoGroup
Pipe
, try to put the largest of the streams in
the leftmost argument to the CoGroup
. The reason
for this is that joining multiple streams requires some accumulation of
values before the join operator can begin, but the leftmost stream is
not accumulated, so this technique should improve the performance of
most joins.
Copyright © 2007-2012 Concurrent, Inc. All Rights Reserved.