【发布时间】:2017-12-20 04:17:32
【问题描述】:
所以我有一个 Python Stream-sourced DataFrame df,它包含我想要放入带有 spark-cassandra-connector 的 Cassandra 表中的所有数据。我尝试过两种方式:
df.write \
.format("org.apache.spark.sql.cassandra") \
.mode('append') \
.options(table="myTable",keyspace="myKeySpace") \
.save()
query = df.writeStream \
.format("org.apache.spark.sql.cassandra") \
.outputMode('append') \
.options(table="myTable",keyspace="myKeySpace") \
.start()
query.awaitTermination()
但是我不断收到此错误:
pyspark.sql.utils.AnalysisException: "'write' can not be called on streaming Dataset/DataFrame;
和
java.lang.UnsupportedOperationException: Data source org.apache.spark.sql.cassandra does not support streamed writing.
无论如何我可以将我的 Streamed DataFrame 发送到我的 Cassandra 表中吗?
【问题讨论】:
标签: apache-spark pyspark spark-cassandra-connector spark-structured-streaming