Done with :
- java -jar tika-server-path --port xxxx
- pip install tika (virtualenv)
parser-tika.py
import tika
from tika import parser
parsed = parser.from_file('/path/to/file')
print parsed["metadata"]
print parsed["content"]
error : ImportError: cannot import name parser
env setup:
TIKA_VERSION=1.13.1
TIKA_SERVER_JAR=~/parserDev/tika/tika-server-1.13.jar
TIKA_SERVER_ENDPOINT=http://localhost:8989/tika
tika.py
and now it imports your file instead of expected module. – Cavalryman