- All Implemented Interfaces:
- Serializable, org.apache.spark.api.java.function.PairFlatMapFunction<Iterator<scala.Tuple2<Long,FrameBlock>>,Integer,Object>
- Enclosing class:
- MultiReturnParameterizedBuiltinSPInstruction
public static class MultiReturnParameterizedBuiltinSPInstruction.TransformEncodeBuildFunction
extends Object
implements org.apache.spark.api.java.function.PairFlatMapFunction<Iterator<scala.Tuple2<Long,FrameBlock>>,Integer,Object>
This function pre-aggregates distinct values of recoded columns per partition
(part of distributed recode map construction, used for recoding, binning and
dummy coding). We operate directly over schema-specific objects to avoid
unnecessary string conversion, as well as reduce memory overhead and shuffle.
- See Also:
- Serialized Form