|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.crunch.DoFn<Pair<Pair<K,Integer>,Iterable<Pair<U,V>>>,Pair<K,Pair<U,V>>> org.apache.crunch.lib.join.JoinFn<K,U,V>
K
- Type of the keys.U
- Type of the first PTable
's valuesV
- Type of the second PTable
's valuespublic abstract class JoinFn<K,U,V>
Represents a DoFn
for performing joins.
Constructor Summary | |
---|---|
JoinFn(PType<K> keyType,
PType<U> leftValueType)
Instantiate with the PType of the value of the left side of the join (used for creating deep copies of values). |
Method Summary | |
---|---|
abstract String |
getJoinType()
|
void |
initialize()
Initialize this DoFn. |
abstract void |
join(K key,
int id,
Iterable<Pair<U,V>> pairs,
Emitter<Pair<K,Pair<U,V>>> emitter)
Performs the actual joining. |
void |
process(Pair<Pair<K,Integer>,Iterable<Pair<U,V>>> input,
Emitter<Pair<K,Pair<U,V>>> emitter)
Split up the input record to make coding a bit more manageable. |
Methods inherited from class org.apache.crunch.DoFn |
---|
cleanup, configure, disableDeepCopy, scaleFactor, setConfiguration, setContext |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public JoinFn(PType<K> keyType, PType<U> leftValueType)
keyType
- The PType of the value used as the key of the joinleftValueType
- The PType of the value type of the left side of the joinMethod Detail |
---|
public void initialize()
DoFn
DoFn.process(Object, Emitter)
is triggered. Subclasses may override
this method to do appropriate initialization.
Called during the setup of the job instance this DoFn
is associated
with.
initialize
in class DoFn<Pair<Pair<K,Integer>,Iterable<Pair<U,V>>>,Pair<K,Pair<U,V>>>
public abstract String getJoinType()
public abstract void join(K key, int id, Iterable<Pair<U,V>> pairs, Emitter<Pair<K,Pair<U,V>>> emitter)
key
- The key for this grouping of values.id
- The side that this group of values is from (0 -> left, 1 -> right).pairs
- The group of values associated with this key and id pair.emitter
- The emitter to send the output to.public void process(Pair<Pair<K,Integer>,Iterable<Pair<U,V>>> input, Emitter<Pair<K,Pair<U,V>>> emitter)
process
in class DoFn<Pair<Pair<K,Integer>,Iterable<Pair<U,V>>>,Pair<K,Pair<U,V>>>
input
- The input record.emitter
- The emitter to send the output to.
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |