org.apache.sysml.runtime.instructions.spark

Class MultiReturnParameterizedBuiltinSPInstruction.TransformEncodeBuildFunction

  • java.lang.Object
    • org.apache.sysml.runtime.instructions.spark.MultiReturnParameterizedBuiltinSPInstruction.TransformEncodeBuildFunction
  • All Implemented Interfaces:
    Serializable, org.apache.spark.api.java.function.PairFlatMapFunction<Iterator<scala.Tuple2<Long,FrameBlock>>,Integer,Object>
    Enclosing class:
    MultiReturnParameterizedBuiltinSPInstruction


    public static class MultiReturnParameterizedBuiltinSPInstruction.TransformEncodeBuildFunction
    extends Object
    implements org.apache.spark.api.java.function.PairFlatMapFunction<Iterator<scala.Tuple2<Long,FrameBlock>>,Integer,Object>
    This function pre-aggregates distinct values of recoded columns per partition (part of distributed recode map construction, used for recoding, binning and dummy coding). We operate directly over schema-specific objects to avoid unnecessary string conversion, as well as reduce memory overhead and shuffle.
    See Also:
    Serialized Form

Copyright © 2017 The Apache Software Foundation. All rights reserved.