【发布时间】:2016-01-25 17:20:42
【问题描述】:
我正在寻找将 Apache Toree 用作 Jupyter 的 Pyspark 内核
https://github.com/apache/incubator-toree
但它使用的是旧版本的 Spark(1.5.1 与当前的 1.6.0)。我尝试在这里通过创建http://arnesund.com/2015/09/21/spark-cluster-on-openstack-with-multi-user-jupyter-notebook/ 使用此方法kernel.js
{
"display_name": "PySpark",
"language": "python",
"argv": [
"/usr/bin/python",
"-m",
"ipykernel",
"-f",
"{connection_file}"
],
"env": {
"SPARK_HOME": "/usr/local/Cellar/apache-spark/1.6.0/libexec",
"PYTHONPATH": "/usr/local/Cellar/apache-spark/1.6.0/libexec/python/:/usr/local/Cellar/apache-spark/1.6.0/libexec/python/lib/py4j-0.9-src.zip",
"PYTHONSTARTUP": "/usr/local/Cellar/apache-spark/1.6.0/libexec/python/pyspark/shell.py",
"PYSPARK_SUBMIT_ARGS": "--master local[*] pyspark-shell"
}
}
但是,我遇到了一些问题:
我的 Mac 中没有
/jupyter/kernels路径。所以我最终创建了这条路径~/.jupyter/kernels/pyspark。我不确定这是否是正确的路径。即使在获得所有正确路径之后,我仍然看不到
PySpark在 Jupyter 中显示为内核。
我错过了什么?
【问题讨论】:
标签: apache-spark ipython pyspark jupyter