【问题标题】:AWS Glue - User did not initialize spark contextAWS Glue - 用户未初始化火花上下文
【发布时间】:2020-07-27 18:06:39
【问题描述】:

我正在使用 AWS 胶水。我正在使用我的自定义 Spark 代码从 AWS 胶水中的 spark scala 作业中读取 AWS Glue 目录中基于配置单元元存储的表,请注意我正在编写自己的代码,这是我们的需要。代码按预期工作,它正在读取源表并加载到目标表,但每次工作仍然出错。

这是我的 sparksession

val spark = SparkSession.builder().appName("SPARK-Dev")
                            .master("local[*]")
                            .enableHiveSupport()
                            .getOrCreate

作业抛出此错误

 2020-03-27 17:07:53,282 ERROR [main] yarn.ApplicationMaster (Logging.scala:logError(91)) - Uncaught exception: 
java.lang.IllegalStateException: User did not initialize spark context!
    at org.apache.spark.deploy.yarn.ApplicationMaster.runDriver(ApplicationMaster.scala:485)
    at org.apache.spark.deploy.yarn.ApplicationMaster.org$apache$spark$deploy$yarn$ApplicationMaster$$runImpl(ApplicationMaster.scala:305)
    at org.apache.spark.deploy.yarn.ApplicationMaster$$anonfun$run$1.apply$mcV$sp(ApplicationMaster.scala:245)
    at org.apache.spark.deploy.yarn.ApplicationMaster$$anonfun$run$1.apply(ApplicationMaster.scala:245)
    at org.apache.spark.deploy.yarn.ApplicationMaster$$anonfun$run$1.apply(ApplicationMaster.scala:245)
    at org.apache.spark.deploy.yarn.ApplicationMaster$$anon$3.run(ApplicationMaster.scala:779)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:422)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1844)
    at org.apache.spark.deploy.yarn.ApplicationMaster.doAsUser(ApplicationMaster.scala:778)
    at org.apache.spark.deploy.yarn.ApplicationMaster.run(ApplicationMaster.scala:244)
    at org.apache.spark.deploy.yarn.ApplicationMaster$.main(ApplicationMaster.scala:803)
    at org.apache.spark.deploy.yarn.ApplicationMaster.main(ApplicationMaster.scala)

【问题讨论】:

  • 如果你给出完整的错误堆栈用户可以帮助你调试。
  • 完整的错误堆栈已更新
  • 我没有看到你创建了 sparkcontext。请参阅github.com/aws-samples/aws-glue-samples/blob/master/examples/… 并使用以下代码创建它: import com.amazonaws.services.glue.util.JsonOptions import com.amazonaws.services.glue.{DynamicFrame, GlueContext} import org.apache.spark.SparkContext object JoinAndRelationalize { def main(sysArgs: Array[String]): Unit = { val sc: SparkContext = new SparkContext() val glueContext: GlueContext = new GlueContext(sc)
  • 我没有使用 GlueContext 或 Glue 动态框架等,还需要初始化吗?

标签: apache-spark aws-glue


【解决方案1】:

您正在使用master("local[*]")。我不确定集群是否正确。尝试在没有master func的情况下使用它

【讨论】:

    猜你喜欢
    • 2019-01-26
    • 2018-07-10
    • 2020-06-25
    • 1970-01-01
    • 1970-01-01
    • 2019-05-03
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多