【发布时间】:2015-04-07 05:08:49
【问题描述】:
在 Google Compute Engine 上部署了一个 Hadoop(Yarn + Spark)集群,具有一个主服务器和两个从服务器。当我运行以下 shell 脚本时:
spark-submit --class org.apache.spark.examples.SparkPi --master yarn-cluster --num-executors 1 --driver-memory 1g --executor-memory 1g --executor-cores 1 /home/hadoop/spark-install/lib/spark-examples-1.1.0-hadoop2.4.0.jar 10
这项工作一直在运行,每秒钟我都会收到一条类似这样的消息:
15/02/06 22:47:12 INFO yarn.Client: Application report from ResourceManager:
application identifier: application_1423247324488_0008<br>
appId: 8<br>
clientToAMToken: null<br>
appDiagnostics:<br>
appMasterHost: hadoop-w-zrem.c.myapp.internal<br>
appQueue: default<br>
appMasterRpcPort: 0<br>
appStartTime: 1423261517468<br>
yarnAppState: RUNNING<br>
distributedFinalState: UNDEFINED<br>
appTrackingUrl: http://hadoop-m-xxxx:8088/proxy/application_1423247324488_0008/<br>
appUser: achitre
【问题讨论】:
标签: scala hadoop apache-spark google-compute-engine hadoop-yarn