Is that even possible!?!
I have a bunch of legacy reports that I need to import into a database. However, they're all in pdf format. Are there any R
packages that can read pdf? Or should I leave that to a command line tool?
The reports were made in excel and then pdfed, so they have regular structure, but many blank "cells".
readPDF
in thetm
package (text mining), but it isn't exactly user friendly and I think it uses the command line utilitypdftotext
under the hood anyway. – Groscrtm::readPDF
, but managed a work-around usingpdftotext
in myR
workflow like this: https://mcmap.net/q/355000/-readpdf-tm-package-in-r – Downstate