poppler Questions
23
I'm trying to use pdf2image and it seems I need something called poppler :
(sum_env) C:\Users\antoi\Documents\Programming\projects\summarizer>python ocr.py -i fr13_idf.pdf
Traceback (most recent...
Manutius asked 26/11, 2018 at 12:25
4
Solved
I just download poppler to Linux system,and I want to incorporate it in my app to parse pdf file.
(My goal is to convert pdf file to plain text.)
How can I do this?
1
I'm using Ubuntu 20.04 and trying to add a digital certificate to Okular to sign pdfs. Okular requires Poppler to be at least version 21.01, to add digital certificate signature, but the Poppler ve...
Integrity asked 2/11, 2023 at 13:12
7
Solved
I have a PDF document that also contains images.
Now I want to know the resolution of these images.
A first step would be to somehow get the images out of the PDF document. But how?
Is that ev...
Valtin asked 24/7, 2012 at 18:22
16
Solved
The most recent version of ScraperWiki depends on Poppler (or so the GitHub says). Unfortunately, it only specifies how to get it on macOS and Linux, not Windows.
A quick googling turned up nothin...
Mcroberts asked 22/8, 2013 at 13:22
3
I'm trying to convert a pdf signed using DocuSign to image format. We are facing the error in convert_from_path method. Code and error are shown below:
import pdf2image
data=pdf2image.convert_from...
Unquestionable asked 15/3, 2021 at 10:43
4
Solved
Poppler is a PDF rendering library based on the xpdf-3.0 code base.
I have already downloaded the tar.xz file from the official site http://poppler.freedesktop.org/
But I do not know what to do wit...
5
Solved
I have to run pdf2image on my Python Lambda Function in AWS, but it requires poppler and poppler-utils to be installed on the machine.
I have tried to search in many different places how to do th...
Delenadeleon asked 20/11, 2018 at 23:58
5
Solved
I want to delete / remove all the images in a PDF leaving only the text / font in the PDF with whatever command Line tool possible.
I tried using -dGraphicsAlphaBits=1 in a Ghostscript command but...
Maieutic asked 19/12, 2013 at 8:29
2
Solved
I have some PDF files which I want to merge into one. I typically use pdfunite from the Poppler PDF toolkit to do this.
However now I'm getting this error:
Could not merge encrypted files ('foo.pd...
Baptlsta asked 8/7, 2020 at 13:11
3
I need to extract pdf annotations in Python. All my searches indicate I need Poppler to do this e.g. to run this
But I am having a lot of trouble installing Poppler. I'm using Python 3.4 on OS X (...
2
Solved
I've been trying to install Poppler to use pdf2image to do some pdf to jpeg conversions using Anaconda Prompt.
I tried running this:
conda install -c conda-forge poppler
However, it gets stuck ...
1
I was using a python package pdf2img, but I got an error asking me to check if I had installed poppler or not , so I ran "pip install python-poppler-qt5" in a code box in colab , but then I get the...
Shrill asked 27/12, 2019 at 8:43
2
I compiled and install poppler-0.39.0 as per the instruction. By default header files went int \usr\local\include and lib files went into \usr\local\lib. pdftohtml is installed in \usr\local\bin.
...
Albertinealbertite asked 2/2, 2016 at 11:28
2
I've got a pdf from which I want to extract some images using Python. I can easily extract images from the Linux command line using the pdfimages from the poppler-utils library like this:
pdfimage...
Fecit asked 22/10, 2014 at 13:47
1
Solved
This link shows how pdfs could be converted to images. Is there a way to zoom my pdfs before converting to images? In my project, i am converting pdfs to pngs and then using Python-tesseract librar...
Clicker asked 22/3, 2019 at 17:58
0
I'm using "pdftotext -bbox file.pdf" to convert a pdf file into HTML.
Here's a sample line from the output:
<word xMin="351.852025" yMin="42.548936" xMax="365.689478"
yMax="47.681498">foo&l...
Elegit asked 6/5, 2018 at 11:23
1
Solved
https://www.fda.gov/downloads/AboutFDA/ReportsManualsForms/Forms/UCM074728.pdf
I'm trying to read this pdf using PyPDF2 or Pdfminer, but it is saying that the File has not been decrypted in Pypdf2...
3
I'd like to write a small script (which will run on a headless Linux server) that reads a PDF, highlights text that matches anything in an array of strings that I pass, then saves the modified PDF....
3
I have a problem with a script with a command line.. The PHP script never continue..
Have tried to call the command line directly through putty and it outputs alot of errors but returns/completes ...
Cathouse asked 26/2, 2017 at 11:26
6
I have grabbed a pdf from the web using for example
import requests
pdf = requests.get("http://www.scala-lang.org/docu/files/ScalaByExample.pdf")
I would like to modify this code to display it
...
Catamnesis asked 10/2, 2014 at 17:46
1
Is there a way to install poppler for python 3 in linux?
I have used apt-get install python-poppler to install it for python 2, but I haven't found how to do it for python 3.
I am using Ubuntu 14...
Inject asked 16/9, 2016 at 0:56
1
Solved
pdfseparate requires to specify %d as PDF-page-pattern which is replaced by the page number.
$ pdfseparate CFL_1115_ISSUU.pdf cfl-%d.pdf works. It sets separated output file names as cfl-1.pdf, cf...
2
I desperatly need your help.
Im trying to compile statically the poppler library (specially for qt4) on windows with the visual c++ 2008 compiler. To achieve this task I needed to compile a bunch ...
Delegation asked 30/1, 2011 at 7:29
1
Since yesterday I'm trying to extract the text from some highlighted annotations in one pdf, using python-poppler-qt4.
According to this documentation, looks like I have to get the text using the ...
1 Next >
© 2022 - 2024 — McMap. All rights reserved.