【发布时间】:2021-09-30 02:35:09
【问题描述】:
spark = SparkSession.builder.appName('QUEUEVQL').getOrCreate()
jsonStrings = {"Name": "SRIDHAR", "Author": "jangcy", "BlogEntries": 100, "Caller": "jangcy"}
dt = [jsonStrings]
dfs = spark.createDataFrame(dt).collect()
dfs2 = spark.sparkContext.parallelize(dfs).toDF()
dfs2.createOrReplaceTempView("QVQL")
resDf = spark.sql("select Name from QVQL")
resDfPandas = resDf.toPandas()
print(resDfPandas)
代码错误:
py4j.protocol.Py4JJavaError:调用 o490.collectToPython 时出错。 :org.apache.spark.SparkException:作业因阶段失败而中止:阶段 27.0 中的任务 0 失败 4 次,最近一次失败:阶段 27.0(TID 98)中丢失任务 0.3(172.17.7.28 执行程序 1):java.io .IOException: Cannot run program "python3.6": CreateProcess error=2, The system cannot find the file specified at java.lang.ProcessBuilder.start(ProcessBuilder.java:1048)
【问题讨论】:
标签: sql json apache-spark pyspark