Package org.openrefine.runners.local.pll
Class UnionPLL<T>
- java.lang.Object
-
- org.openrefine.runners.local.pll.PLL<T>
-
- org.openrefine.runners.local.pll.UnionPLL<T>
-
- Type Parameters:
T-
public class UnionPLL<T> extends PLL<T>
A PLL which represents the concatenation of a sequence of other PLLs- Author:
- Antonin Delpeuch
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description protected static classUnionPLL.UnionPartition-
Nested classes/interfaces inherited from class org.openrefine.runners.local.pll.PLL
PLL.LastFlush, PLL.PLLExecutionError
-
-
Field Summary
-
Fields inherited from class org.openrefine.runners.local.pll.PLL
cachedPartitions, context, id, name
-
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected CloseableIterator<T>compute(Partition partition)Iterate over the elements of the given partition.protected io.vavr.collection.Array<Long>computePartitionSizes()List<PLL<?>>getParents()Returns the PLLs that this PLL depends on, to compute its contents.io.vavr.collection.Array<? extends Partition>getPartitions()booleanhasCachedPartitionSizes()Is this PLL aware of the size of its partitions?-
Methods inherited from class org.openrefine.runners.local.pll.PLL
aggregate, batchPartitions, cacheAsync, collect, collectPartitionsAsync, concatenate, concatenate, count, dropFirstElements, dropLastElements, filter, flatMap, getContext, getId, getPartitionSizes, getQueryTree, isCached, isEmpty, iterate, iterateFromPartition, iterator, limitPartitions, map, mapPartitions, mapToPair, mapToPair, numPartitions, retainPartitions, runOnPartitions, runOnPartitions, runOnPartitionsAsync, runOnPartitionsAsync, runOnPartitionsWithoutInterruption, runOnPartitionsWithoutInterruption, saveAsTextFile, saveAsTextFileAsync, scanMap, scanMapStream, sort, take, toString, uncache, withCachedPartitionSizes, writeOriginalPartition, writePartition, writePlannedPartition, zipWithIndex
-
-
-
-
Method Detail
-
compute
protected CloseableIterator<T> compute(Partition partition)
Description copied from class:PLLIterate over the elements of the given partition. This is the method that should be implemented by subclasses. As this method forces computation, ignoring any caching, consumers should not call it directly but rather usePLL.iterate(Partition). Once the iterator is not needed anymore, it should be closed. This makes it possible to release the underlying resources supporting it, such as open files or sockets.
-
getPartitions
public io.vavr.collection.Array<? extends Partition> getPartitions()
- Specified by:
getPartitionsin classPLL<T>- Returns:
- the partitions in this list
-
computePartitionSizes
protected io.vavr.collection.Array<Long> computePartitionSizes()
- Overrides:
computePartitionSizesin classPLL<T>
-
hasCachedPartitionSizes
public boolean hasCachedPartitionSizes()
Description copied from class:PLLIs this PLL aware of the size of its partitions?- Overrides:
hasCachedPartitionSizesin classPLL<T>
-
getParents
public List<PLL<?>> getParents()
Description copied from class:PLLReturns the PLLs that this PLL depends on, to compute its contents. This is used for debugging purposes, to display the tree of dependencies of a given PLL.- Specified by:
getParentsin classPLL<T>- See Also:
PLL.getQueryTree()
-
-