Package | Description |
---|---|
org.apache.crunch |
Client-facing API and core abstractions.
|
org.apache.crunch.contrib.bloomfilter |
Support for creating Bloom Filters.
|
org.apache.crunch.lib |
Joining, sorting, aggregating, and other commonly used functionality.
|
Modifier and Type | Method and Description |
---|---|
PObject<Collection<S>> |
PCollection.asCollection() |
PObject<Map<K,V>> |
PTable.asMap()
|
PObject<Long> |
PCollection.length()
Returns the number of elements represented by this
PCollection . |
PObject<S> |
PCollection.max()
Returns a
PObject of the maximum element of this instance. |
PObject<S> |
PCollection.min()
Returns a
PObject of the minimum element of this instance. |
Modifier and Type | Method and Description |
---|---|
static PObject<Map<String,org.apache.hadoop.util.bloom.BloomFilter>> |
BloomFilterFactory.createFilter(org.apache.hadoop.fs.Path inputPath,
BloomFilterFn<String> filterFn)
The method will take an input path and generates BloomFilters for all text
files in that path.
|
static <T> PObject<org.apache.hadoop.util.bloom.BloomFilter> |
BloomFilterFactory.createFilter(PCollection<T> collection,
BloomFilterFn<T> filterFn) |
Modifier and Type | Method and Description |
---|---|
static <S> PObject<Long> |
Aggregate.length(PCollection<S> collect)
Returns the number of elements in the provided PCollection.
|
static <S> PObject<S> |
Aggregate.max(PCollection<S> collect)
Returns the largest numerical element from the input collection.
|
static <S> PObject<S> |
Aggregate.min(PCollection<S> collect)
Returns the smallest numerical element from the input collection.
|
Copyright © 2013 The Apache Software Foundation. All Rights Reserved.