public final class Distinct extends Object
PCollection
.Modifier and Type | Method and Description |
---|---|
static <S> PCollection<S> |
distinct(PCollection<S> input)
Construct a new
PCollection that contains the unique elements of a
given input PCollection . |
static <S> PCollection<S> |
distinct(PCollection<S> input,
int flushEvery)
A
distinct operation that gives the client more control over how frequently
elements are flushed to disk in order to allow control over performance or
memory consumption. |
static <K,V> PTable<K,V> |
distinct(PTable<K,V> input)
A
PTable<K, V> analogue of the distinct function. |
static <K,V> PTable<K,V> |
distinct(PTable<K,V> input,
int flushEvery)
A
PTable<K, V> analogue of the distinct function. |
public static <S> PCollection<S> distinct(PCollection<S> input)
PCollection
that contains the unique elements of a
given input PCollection
.input
- The input PCollection
PCollection
that contains the unique elements of the inputpublic static <K,V> PTable<K,V> distinct(PTable<K,V> input)
PTable<K, V>
analogue of the distinct
function.public static <S> PCollection<S> distinct(PCollection<S> input, int flushEvery)
distinct
operation that gives the client more control over how frequently
elements are flushed to disk in order to allow control over performance or
memory consumption.input
- The input PCollection
flushEvery
- Flush the elements to disk whenever we encounter this many unique valuesPCollection
that contains the unique elements of the inputCopyright © 2016 The Apache Software Foundation. All rights reserved.