apache-tika Questions

1

Could please anybody who managed to do that explain how to do that :-) Do I need to get n-gram files for the language I need to add ? Is it a matter of creating tika.language.override.properties,...
Graphomotor asked 3/6, 2011 at 13:16

1

Solved

Could please anybody help me to sort it out? It can be done like this Tika tika = new Tika(); tika.setMaxStringLength(10*1024*1024); But if you don't use Tika directly, like this: ContentH...
Fernando asked 26/5, 2011 at 20:33

2

Solved

I want to be able to create a new Tika parser to extract metadata from a file. We're already using Tika and the metadata extraction will be done consistently. I think that I've run into this probl...
Inexplicit asked 17/5, 2011 at 21:32

1

Solved

I'm doing simple processing of variety of documents (ODS, MS office, pdf) using Apache Tika. I have to get at least : word count, author, title, timestamps, language etc. which is not so easy. M...
Whitney asked 26/2, 2011 at 21:47

2

Solved

I want to index a large number of pdf documents. I have found a reference showing that it could be done using Apache Tika but unfortunately I cannot find any reference that describes I could conf...
Terrell asked 5/10, 2010 at 13:9

© 2022 - 2024 — McMap. All rights reserved.