【发布时间】:2020-12-03 14:10:43
【问题描述】:
由于无法修改 spark scala 中的特定列值,我很难加入这两个数据框视图。我想我必须以某种方式进行转置/加入,但无法弄清楚。
这是第一个数据框:
var sample_df = Seq(("john","morning","7am"),("john","night","10pm"),("bob","morning","8am"),("bob","night","11pm"),("phil","morning","9am"),("phil","night","10pm")).toDF("person","time_of_day","wake/sleep hour")
这是第二个数据框:
var sample_df2 = Seq(("john","6am","11pm"),("bob","7am","2am"),("phil","8am","1am")).toDF("person","morning_earliest","night_latest")
这是我要生成的结果数据框:
var resulting_df = Seq(("john","morning","7am","6am"),("john","night","10pm","11pm"),("bob","morning","8am","7am"),("bob","night","11pm","2am"),("phil","morning","9am","8am"),("phil","night","10pm","1am")).toDF("person","time_of_day","wake/sleep hour","earliest/latest")
任何帮助将不胜感激!谢谢,祝您有美好的一天!
【问题讨论】:
标签: scala apache-spark apache-spark-sql