apache-tika Questions
1
Could please anybody who managed to do that explain how to do that :-)
Do I need to get n-gram files for the language I need to add ?
Is it a matter of creating tika.language.override.properties,...
Graphomotor asked 3/6, 2011 at 13:16
1
Solved
Could please anybody help me to sort it out?
It can be done like this
Tika tika = new Tika();
tika.setMaxStringLength(10*1024*1024);
But if you don't use Tika directly, like this:
ContentH...
Fernando asked 26/5, 2011 at 20:33
2
Solved
I want to be able to create a new Tika parser to extract metadata from a file. We're already using Tika and the metadata extraction will be done consistently.
I think that I've run into this probl...
Inexplicit asked 17/5, 2011 at 21:32
1
Solved
I'm doing simple processing of variety of documents (ODS, MS office, pdf) using Apache Tika. I have to get at least :
word count, author, title, timestamps, language etc.
which is not so easy. M...
Whitney asked 26/2, 2011 at 21:47
2
Solved
I want to index a large number of pdf documents.
I have found a reference showing that it could be done using Apache Tika but unfortunately I cannot find any reference that describes I could conf...
Terrell asked 5/10, 2010 at 13:9
© 2022 - 2024 — McMap. All rights reserved.