Using Nutch solrindex to index to multiple cores?
Asked Answered
M

2

6

Is there parameter in the bin/nutch solrindex command to indicate which Solr core to index to?

Mooney answered 1/5, 2012 at 7:37 Comment(0)
P
8

I'm not aware of any core parameter. You should just include the name of the core in your solr url parameter like this: http://localhost:8983/solr/core0.

Pythagoreanism answered 1/5, 2012 at 10:3 Comment(4)
That makes perfect sense now.Mooney
This definitely works, don't be put off by the fact that you see a 404 when you access localhost:8983/solr/core_name, this is the way to choose a specific Solr core for your Apache Nutch spider...thanksBastardize
just add /admin to it to avoid getting 404. Nutch will use the select and update request handlers, adding their path to the configured solr core url.Pythagoreanism
Worked for me. I initially used host:8983/solr/#/core_name, but redirected me to index.html and failed with 405 method not supported.Eton
A
0

In nutch 1.4, this is what I use to index to different cores:

bin/nutch crawl urls/url1 -solr http://localhost:8983/solr/core1 -depth 10 -topN 10000

Adventure answered 10/5, 2012 at 15:15 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.