public final class Distinct extends Object
PCollection.| Modifier and Type | Method and Description |
|---|---|
static <S> PCollection<S> |
distinct(PCollection<S> input)
Construct a new
PCollection that contains the unique elements of a
given input PCollection. |
static <S> PCollection<S> |
distinct(PCollection<S> input,
int flushEvery)
A
distinct operation that gives the client more control over how frequently
elements are flushed to disk in order to allow control over performance or
memory consumption. |
static <K,V> PTable<K,V> |
distinct(PTable<K,V> input)
A
PTable<K, V> analogue of the distinct function. |
static <K,V> PTable<K,V> |
distinct(PTable<K,V> input,
int flushEvery)
A
PTable<K, V> analogue of the distinct function. |
public static <S> PCollection<S> distinct(PCollection<S> input)
PCollection that contains the unique elements of a
given input PCollection.input - The input PCollectionPCollection that contains the unique elements of the inputpublic static <K,V> PTable<K,V> distinct(PTable<K,V> input)
PTable<K, V> analogue of the distinct function.public static <S> PCollection<S> distinct(PCollection<S> input, int flushEvery)
distinct operation that gives the client more control over how frequently
elements are flushed to disk in order to allow control over performance or
memory consumption.input - The input PCollectionflushEvery - Flush the elements to disk whenever we encounter this many unique valuesPCollection that contains the unique elements of the inputCopyright © 2015 The Apache Software Foundation. All Rights Reserved.