【发布时间】:2015-08-02 18:47:22
【问题描述】:
我正在尝试执行 map reduce 代码如下:
hadoop jar /usr/lib/Hadoop/Hadoop-streaming-0.20.2-cdh3u2.jar –file mapper.py –mapper mapper.py –file reducer.py – reducer reducer.py –input /user/training/samplypy.txt –ouput /user/training/pythonMR/output
遇到异常 -
Exception in thread "main" java.lang.ClassNotFoundException: –file
at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:423)
at java.lang.ClassLoader.loadClass(ClassLoader.java:356)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:264)
at org.apache.hadoop.util.RunJar.main(RunJar.java:149)
我正在使用 Hadoop 1.0.3。我尝试过多个版本的 hadoop-streaming jar,例如:
hadoop-streaming-0.20.2-cdh3u2.jar
hadoop-streaming-1.2.0.jar
hadoop-streaming.jar
【问题讨论】:
-
告诉你运行这些命令的文档在哪里?
-
你的
$HADOOP_HOMEenv 变量集了吗?
标签: python hadoop mapreduce hadoop-streaming