【发布时间】:2019-11-25 22:55:04
【问题描述】:
我正在尝试从 spark 数据框中选择特定的列。
具体的列列表是:
required_cols = ['123ABC.PM','456DEF.PM']
Spark_df 采用给定格式:
'123ABC.PM', '54SWC.PM', '456DEF.PM', '154AS.LB'
23.5 34.5 400.7 100.3
25.4 37.6 401 100
and so on
我已经试过了:
spark_df_new = spark_df.select(required_cols)
但我收到错误:
"cannot resolve '`123ABC.PM`' given input columns: [123ABC.PM,54SWC.PM, 456DEF.PM,154AS.LB]
``
【问题讨论】:
标签: apache-spark pyspark apache-spark-sql pyspark-sql