public class MapsideJoin extends Object
PTable
s.
A map side join is an optimized join which doesn't use a reducer; instead, the right side of the join is loaded into memory and the join is performed in a mapper. This style of join has the important implication that the output of the join is not sorted, which is the case with a conventional (reducer-based) join.
Note:This utility is only supported when running with a
MRPipeline
as the pipeline.
Constructor and Description |
---|
MapsideJoin() |
public static <K,U,V> PTable<K,Pair<U,V>> join(PTable<K,U> left, PTable<K,V> right)
left
- The left-side table of the joinright
- The right-side table of the join, whose contents will be fully
read into memoryCopyright © 2013 The Apache Software Foundation. All Rights Reserved.