【发布时间】:2022-01-05 09:32:39
【问题描述】:
如何以最有效的方式在 Delta Lake 的 2 列之间切换名称?假设我有以下列:
Address | Name
我想换个名字,有:
Name | Address
首先我重命名了两列:
spark.read.table(„table”) \
.withColumnRenamed("address", "name1") \
.withColumnRenamed("name", "address1") \
.write \
.format("delta") \
.mode("overwrite") \
.option("overwriteSchema", "true") \
.saveAsTable("table”")
然后我将已经重命名的列重命名为最后一个:
spark.read.table("table”") \
.withColumnRenamed("name1", "name") \
.withColumnRenamed("address1", "address") \
.write \
.format("delta") \
.mode("overwrite") \
.option("overwriteSchema", "true") \
.saveAsTable("table”")
【问题讨论】:
标签: apache-spark pyspark databricks delta-lake