ElasticSearch Indexing Confluence pages
Asked Answered
C

1

12

Can ElasticSearch index Confluence pages? There are a lot of river plugins but none for Confluence. http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/modules-plugins.html

Although there is a github project https://github.com/obazoud/elasticsearch-river-confluence but the last commit is a year ago, so I guess it's not up-to-date.

Conduce answered 10/10, 2013 at 13:26 Comment(5)
Here's an option if you're using confluence version 3 or 4: searchtechnologies.com/elasticsearch-connector-confluence.html but it doesn't appear to be freeCatima
Thanks but I'm looking for a free connector.Conduce
@maestr0, I guess it is too late for answer but you could try to read Confluence content right from the database (I guess BODYCONTENT table). I don't know if it is legal according to Atlassian license but it can be safe for sure as long as you only read from database. Unfortunatelly you must to write some converter yourself - for example to bulk insert data to ElasticSearch.Melgar
theres a chapter in confluence doc: confluence.atlassian.com/doc/…Fusible
It looks like Confluence once tried to support Elasticsearch but it never go past experimental state: confluence.atlassian.com/confkb/… Rivers are outdated so I guess we're currently limited to crawling or reading the db directly.Kyle
S
1

Elasticsearch deprecated river.

Elasticsearch has a solution built over it called workplace search which could connect to confluence for ingesting data.

Ideally, you might need to do it by the Confluent API via a script to Elasticsearch. You might also need to use the "ingest-attachment" plugin if you need to parse PDF content.

Stutman answered 4/5, 2021 at 12:2 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.