| 
 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectorg.apache.crunch.impl.mem.collect.MemCollection<Pair<K,V>>
org.apache.crunch.impl.mem.collect.MemTable<K,V>
public class MemTable<K,V>
| Constructor Summary | |
|---|---|
| MemTable(Iterable<Pair<K,V>> collect) | |
| MemTable(Iterable<Pair<K,V>> collect,
         PTableType<K,V> ptype,
         String name) | |
| Method Summary | ||
|---|---|---|
|  PObject<Map<K,V>> | asMap()Returns a PObjectencapsulating aMapmade up of the keys and values in thisPTable. | |
|  PTable<K,V> | bottom(int count)Returns a PTable made up of the pairs in this PTable with the smallest value field. | |
|  PTable<K,V> | cache()Marks this data as cached using the default CachingOptions. | |
|  PTable<K,V> | cache(CachingOptions options)Marks this data as cached using the given CachingOptions. | |
| 
 | cogroup(PTable<K,U> other)Co-group operation with the given table on common keys. | |
|  PTable<K,Collection<V>> | collectValues()Aggregate all of the values with the same key into a single key-value pair in the returned PTable. | |
|  PTable<K,V> | filter(FilterFn<Pair<K,V>> filterFn)Apply the given filter function to this instance and return the resulting PCollection. | |
|  PTable<K,V> | filter(String name,
       FilterFn<Pair<K,V>> filterFn)Apply the given filter function to this instance and return the resulting PCollection. | |
|  PType<K> | getKeyType()Returns the PTypeof the key. | |
|  PTableType<K,V> | getPTableType()Returns the PTableTypeof thisPTable. | |
|  PType<V> | getValueType()Returns the PTypeof the value. | |
|  PGroupedTable<K,V> | groupByKey()Performs a grouping operation on the keys of this table. | |
|  PGroupedTable<K,V> | groupByKey(GroupingOptions options)Performs a grouping operation on the keys of this table, using the additional GroupingOptionsto control how the grouping is executed. | |
|  PGroupedTable<K,V> | groupByKey(int numPartitions)Performs a grouping operation on the keys of this table, using the given number of partitions. | |
| 
 | join(PTable<K,U> other)Perform an inner join on this table and the one passed in as an argument on their common keys. | |
|  PCollection<K> | keys()Returns a PCollectionmade up of the keys in this PTable. | |
| 
 | mapKeys(MapFn<K,K2> mapFn,
        PType<K2> ptype)Returns a PTablethat has the same values as this instance, but
 uses the given function to map the keys. | |
| 
 | mapKeys(String name,
        MapFn<K,K2> mapFn,
        PType<K2> ptype)Returns a PTablethat has the same values as this instance, but
 uses the given function to map the keys. | |
| 
 | mapValues(MapFn<V,U> mapFn,
          PType<U> ptype)Returns a PTablethat has the same keys as this instance, but
 uses the given function to map the values. | |
| 
 | mapValues(String name,
          MapFn<V,U> mapFn,
          PType<U> ptype)Returns a PTablethat has the same keys as this instance, but
 uses the given function to map the values. | |
|  Map<K,V> | materializeToMap()Returns a Map | |
|  PTable<K,V> | top(int count)Returns a PTable made up of the pairs in this PTable with the largest value field. | |
|  PTable<K,V> | union(PTable<K,V>... others)Returns a PTableinstance that acts as the union of thisPTableand the inputPTables. | |
|  PTable<K,V> | union(PTable<K,V> other)Returns a PTableinstance that acts as the union of thisPTableand the otherPTables. | |
|  PCollection<V> | values()Returns a PCollectionmade up of the values in this PTable. | |
|  PTable<K,V> | write(Target target)Write the contents of this PCollectionto the givenTarget,
 using the storage format specified by the target. | |
|  PTable<K,V> | write(Target target,
      Target.WriteMode writeMode)Write the contents of this PCollectionto the givenTarget,
 using the givenTarget.WriteModeto handle existing
 targets. | |
| Methods inherited from class org.apache.crunch.impl.mem.collect.MemCollection | 
|---|
| asCollection, asReadable, by, by, count, getCollection, getName, getPipeline, getPType, getSize, getTypeFamily, length, materialize, max, min, parallelDo, parallelDo, parallelDo, parallelDo, parallelDo, parallelDo, toString, union, union | 
| Methods inherited from class java.lang.Object | 
|---|
| clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait | 
| Methods inherited from interface org.apache.crunch.PCollection | 
|---|
| asCollection, asReadable, by, by, count, getName, getPipeline, getPType, getSize, getTypeFamily, length, materialize, max, min, parallelDo, parallelDo, parallelDo, parallelDo, parallelDo, parallelDo, union, union | 
| Constructor Detail | 
|---|
public MemTable(Iterable<Pair<K,V>> collect)
public MemTable(Iterable<Pair<K,V>> collect,
                PTableType<K,V> ptype,
                String name)
| Method Detail | 
|---|
public PTable<K,V> union(PTable<K,V> other)
PTablePTable instance that acts as the union of this
 PTable and the other PTables.
union in interface PTable<K,V>public PTable<K,V> union(PTable<K,V>... others)
PTablePTable instance that acts as the union of this
 PTable and the input PTables.
union in interface PTable<K,V>public PGroupedTable<K,V> groupByKey()
PTable
groupByKey in interface PTable<K,V>PGroupedTable instance that represents the groupingpublic PGroupedTable<K,V> groupByKey(int numPartitions)
PTable
groupByKey in interface PTable<K,V>numPartitions - The number of partitions for the data.
PGroupedTable instance that represents this groupingpublic PGroupedTable<K,V> groupByKey(GroupingOptions options)
PTableGroupingOptions to control how the grouping is executed.
groupByKey in interface PTable<K,V>options - The grouping options to use
PGroupedTable instance that represents the groupingpublic PTable<K,V> write(Target target)
PCollectionPCollection to the given Target,
 using the storage format specified by the target.
write in interface PCollection<Pair<K,V>>write in interface PTable<K,V>write in class MemCollection<Pair<K,V>>target - The target to write to
public PTable<K,V> write(Target target,
                         Target.WriteMode writeMode)
PCollectionPCollection to the given Target,
 using the given Target.WriteMode to handle existing
 targets.
write in interface PCollection<Pair<K,V>>write in interface PTable<K,V>write in class MemCollection<Pair<K,V>>target - The targetwriteMode - The rule for handling existing outputs at the target locationpublic PTable<K,V> cache()
PCollectionCachingOptions. Cached PCollections will only
 be processed once, and then their contents will be saved so that downstream code can process them many times.
cache in interface PCollection<Pair<K,V>>cache in interface PTable<K,V>cache in class MemCollection<Pair<K,V>>PCollection instancepublic PTable<K,V> cache(CachingOptions options)
PCollectionCachingOptions. Cached PCollections will only
 be processed once and then their contents will be saved so that downstream code can process them many times.
cache in interface PCollection<Pair<K,V>>cache in interface PTable<K,V>cache in class MemCollection<Pair<K,V>>options - the options that control the cache settings for the data
PCollection instancepublic PTableType<K,V> getPTableType()
PTablePTableType of this PTable.
getPTableType in interface PTable<K,V>public PType<K> getKeyType()
PTablePType of the key.
getKeyType in interface PTable<K,V>public PType<V> getValueType()
PTablePType of the value.
getValueType in interface PTable<K,V>public PTable<K,V> filter(FilterFn<Pair<K,V>> filterFn)
PCollectionPCollection.
filter in interface PCollection<Pair<K,V>>filter in interface PTable<K,V>filter in class MemCollection<Pair<K,V>>
public PTable<K,V> filter(String name,
                          FilterFn<Pair<K,V>> filterFn)
PCollectionPCollection.
filter in interface PCollection<Pair<K,V>>filter in interface PTable<K,V>filter in class MemCollection<Pair<K,V>>name - An identifier for this processing stepfilterFn - The FilterFn to apply
public <U> PTable<K,U> mapValues(MapFn<V,U> mapFn,
                                 PType<U> ptype)
PTablePTable that has the same keys as this instance, but
 uses the given function to map the values.
mapValues in interface PTable<K,V>
public <U> PTable<K,U> mapValues(String name,
                                 MapFn<V,U> mapFn,
                                 PType<U> ptype)
PTablePTable that has the same keys as this instance, but
 uses the given function to map the values.
mapValues in interface PTable<K,V>
public <K2> PTable<K2,V> mapKeys(MapFn<K,K2> mapFn,
                                 PType<K2> ptype)
PTablePTable that has the same values as this instance, but
 uses the given function to map the keys.
mapKeys in interface PTable<K,V>
public <K2> PTable<K2,V> mapKeys(String name,
                                 MapFn<K,K2> mapFn,
                                 PType<K2> ptype)
PTablePTable that has the same values as this instance, but
 uses the given function to map the keys.
mapKeys in interface PTable<K,V>public PTable<K,V> top(int count)
PTable
top in interface PTable<K,V>count - The number of pairs to returnpublic PTable<K,V> bottom(int count)
PTable
bottom in interface PTable<K,V>count - The number of pairs to returnpublic PTable<K,Collection<V>> collectValues()
PTable
collectValues in interface PTable<K,V>public <U> PTable<K,Pair<V,U>> join(PTable<K,U> other)
PTable
join in interface PTable<K,V>public <U> PTable<K,Pair<Collection<V>,Collection<U>>> cogroup(PTable<K,U> other)
PTable
cogroup in interface PTable<K,V>public PCollection<K> keys()
PTablePCollection made up of the keys in this PTable.
keys in interface PTable<K,V>public PCollection<V> values()
PTablePCollection made up of the values in this PTable.
values in interface PTable<K,V>public Map<K,V> materializeToMap()
PTableNote: The contents of the returned map may not be exactly the same as this PTable, as a PTable is a multi-map (i.e. can contain multiple values for a single key).
materializeToMap in interface PTable<K,V>public PObject<Map<K,V>> asMap()
PTablePObject encapsulating a Map made up of the keys and values in this
 PTable.
 Note:The contents of the returned map may not be exactly the same as this PTable, as a PTable is a multi-map (i.e. can contain multiple values for a single key).
asMap in interface PTable<K,V>PObject encapsulating a Map made up of the keys and values in
 this PTable.| 
 | |||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||