public static class Aggregate.TopKCombineFn<K,V> extends CombineFn<Integer,Pair<K,V>>
Constructor and Description |
---|
TopKCombineFn(int limit,
boolean maximize,
PType<Pair<K,V>> pairType) |
Modifier and Type | Method and Description |
---|---|
void |
initialize()
Initialize this DoFn.
|
void |
process(Pair<Integer,Iterable<Pair<K,V>>> input,
Emitter<Pair<Integer,Pair<K,V>>> emitter)
Processes the records from a
PCollection . |
cleanup, configure, disableDeepCopy, scaleFactor, setConfiguration, setContext
public void initialize()
DoFn
DoFn.process(Object, Emitter)
is triggered. Subclasses may override
this method to do appropriate initialization.
Called during the setup of the job instance this DoFn
is associated
with.
public void process(Pair<Integer,Iterable<Pair<K,V>>> input, Emitter<Pair<Integer,Pair<K,V>>> emitter)
DoFn
PCollection
.
DoFn.process(Object, Emitter)
method call. This
functionality is imposed by Hadoop's Reducer implementation: The framework will reuse the key and value
objects that are passed into the reduce, therefore the application should
clone the objects they want to keep a copy of.Copyright © 2016 The Apache Software Foundation. All rights reserved.