【发布时间】:2019-04-15 13:58:10
【问题描述】:
我想将几个经过训练的管道连接到一个,这类似于 "Spark add new fitted stage to a exitsting PipelineModel without fitting again" 但是下面的解决方案适用于 PySpark。
> pipe_model_new = PipelineModel(stages = [pipe_model , pipe_model2])
> final_df = pipe_model_new.transform(df1)
在 Apache Spark 2.0 中,“PipelineModel”的构造函数被标记为私有,因此不能在外部调用。在“Pipeline”类中,只有“fit”方法创建“PipelineModel”
val pipelineModel = new PipelineModel("randomUID", trainedStages)
val df_final_full = pipelineModel.transform(df)
Error:(266, 26) constructor PipelineModel in class PipelineModel cannot be accessed in class Preprocessor val pipelineModel = new PipelineModel("randomUID", trainedStages)
【问题讨论】:
标签: apache-spark pipeline apache-spark-ml apache-spark-2.0