SolrIndexerJob:java.lang.RuntimeException:作业失败:
SolrIndexerJob: java.lang.RuntimeException: job failed:
Apache nutch \ "crawl" 脚本无法索引到 solr。对此有什么想法吗?
IndexingJob: starting
SolrIndexerJob: java.lang.RuntimeException: job failed: name=[myId_1]Indexer, jobid=job_local483340309_0001
at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:120)
at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:154)
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:176)
at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:202)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:211)
此消息信息量不大,check/postSolr 记录任何相关错误。造成这种情况的常见原因之一是 Nutch 和 Solr 架构之间的不匹配。
1) 您必须配置 gora.properties,
2) Gora 背后的任何东西(Mongo、HBase、Cassandra 等)都没有响应,所以 nutch 需要 "waitForCompletion",所以请确保它已启动并且运行.
确保使用 kill -9 杀死旧的已停用进程,以及旧的 java nutch 进程,如果找不到它们则重新启动(希望它不会出现...)
Apache nutch \ "crawl" 脚本无法索引到 solr。对此有什么想法吗?
IndexingJob: starting
SolrIndexerJob: java.lang.RuntimeException: job failed: name=[myId_1]Indexer, jobid=job_local483340309_0001
at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:120)
at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:154)
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:176)
at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:202)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:211)
此消息信息量不大,check/postSolr 记录任何相关错误。造成这种情况的常见原因之一是 Nutch 和 Solr 架构之间的不匹配。
1) 您必须配置 gora.properties,
2) Gora 背后的任何东西(Mongo、HBase、Cassandra 等)都没有响应,所以 nutch 需要 "waitForCompletion",所以请确保它已启动并且运行.
确保使用 kill -9 杀死旧的已停用进程,以及旧的 java nutch 进程,如果找不到它们则重新启动(希望它不会出现...)