读书人

Nutch运行后的有关问题

发布时间: 2012-03-23 12:06:21 作者: rapoo

Nutch运行后的问题
配置好,没有错误后运行
出现

crawl started in: crawl
rootUrlDir = urls
threads = 10
depth = 3
topN = 50
Injector: starting
Injector: crawlDb: crawl/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.io.IOException: Failed to get the current user's information.
at org.apache.hadoop.mapred.JobClient.getUGI(JobClient.java:717)
at org.apache.hadoop.mapred.JobClient.configureCommandLineOptions(JobClient.java:592)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:788)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1142)
at org.apache.nutch.crawl.Injector.inject(Injector.java:160)
at org.apache.nutch.crawl.Crawl.main(Crawl.java:113)
Caused by: javax.security.auth.login.LoginException: Login failed: Cannot run program "whoami": CreateProcess error=2, ?????????
at org.apache.hadoop.security.UnixUserGroupInformation.login(UnixUserGroupInformation.java:250)
at org.apache.hadoop.security.UnixUserGroupInformation.login(UnixUserGroupInformation.java:275)
at org.apache.hadoop.mapred.JobClient.getUGI(JobClient.java:715)
... 5 more


请问这是怎么回事?
谢谢了啊~


[解决办法]
修改下conf文件夹下的nutch-site.xml文件:

XML code
    <property>      <name>http.agent.name</name>      <value>HD nutch agent</value>      <description></description>    </property>        <property>      <name>http.agent.description</name>      <value>hpjianhua</value>    <description></description>    </property>        <property>      <name>http.agent.url</name>      <value>http://www.163.com</value>      <description></description>    </property>        <property>      <name>http.agent.email</name>      <value>hpjianhua@163.com</value>      <description></description>    </property> 

读书人网 >网络基础

热点推荐