nutch 在hadoop运行时引用包不同所引发的问题
今天在部署Nutch的时候出现一个小问题,
Exception in thread "main" java.io.IOException: Call to /172.0.8.252:9000 failed on local exception: java.io.EOFExceptionat org.apache.hadoop.ipc.Client.wrapException(Client.java:1089)at org.apache.hadoop.ipc.Client.call(Client.java:1057)at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:226)at $Proxy0.getProtocolVersion(Unknown Source)at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:369)at org.apache.hadoop.hdfs.DFSClient.createRPCNamenode(DFSClient.java:111)at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:213)at org.apache.hadoop.hdfs.DFSClient.<init>(DFSClient.java:180)at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:89)at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:1489)at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:66)at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:1523)at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:1505)at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:227)at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:110)at org.apache.nutch.crawl.Crawl.copyUrlFile(Crawl.java:129)at org.apache.nutch.crawl.Crawl.main(Crawl.java:505)Caused by: java.io.EOFExceptionat java.io.DataInputStream.readInt(DataInputStream.java:375)at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:781)at org.apache.hadoop.ipc.Client$Connection.run(Client.java:689)
后来才发现是在nutch 中引用的是hadoop-core-0.20.3-CDH3-SNAPSHOT.jar 而运行的hadoop系统中是
hadoop-0.20.2-core.jar
hadoop-0.20.2-examples.jar
hadoop-0.20.2-fairscheduler.jar
hadoop-0.20.2-test.jar
因此造成了无法访问hdfs
1 楼 chenyuxxgl 2011-09-27 请问你的nutch是什么版本