Package | Description |
---|---|
org.apache.crunch.lambda |
Alternative Crunch API using Java 8 features to allow construction of pipelines using lambda functions and method
references.
|
org.apache.crunch.lib.join |
Inner and outer joins on collections.
|
Modifier and Type | Method and Description |
---|---|
default <U> LTable<K,Pair<V,U>> |
LTable.join(LTable<K,U> other,
JoinType joinType,
JoinStrategy<K,V,U> joinStrategy)
Join this table to another
LTable which has the same key type using the provided JoinType and
JoinStrategy |
Modifier and Type | Class and Description |
---|---|
class |
BloomFilterJoinStrategy<K,U,V>
Join strategy that uses a Bloom filter
that is trained on the keys of the left-side table to filter the key/value pairs of the right-side
table before sending through the shuffle and reduce phase.
|
class |
DefaultJoinStrategy<K,U,V>
Default join strategy that simply sends all data through the map, shuffle, and reduce phase.
|
class |
MapsideJoinStrategy<K,U,V>
Utility for doing map side joins on a common key between two
PTable s. |
class |
ShardedJoinStrategy<K,U,V>
JoinStrategy that splits the key space up into shards.
|
Constructor and Description |
---|
BloomFilterJoinStrategy(int numElements,
float falsePositiveRate,
JoinStrategy<K,U,V> delegateJoinStrategy)
Instantiate with the expected number of unique keys in the left table, and the acceptable
false positive rate for the Bloom filter, and an underlying join strategy to delegate to.
|
Copyright © 2016 The Apache Software Foundation. All rights reserved.