poppler Questions

23

I'm trying to use pdf2image and it seems I need something called poppler : (sum_env) C:\Users\antoi\Documents\Programming\projects\summarizer>python ocr.py -i fr13_idf.pdf Traceback (most recent...
Manutius asked 26/11, 2018 at 12:25

4

Solved

I just download poppler to Linux system,and I want to incorporate it in my app to parse pdf file. (My goal is to convert pdf file to plain text.) How can I do this?
Stripteaser asked 17/1, 2012 at 5:59

1

I'm using Ubuntu 20.04 and trying to add a digital certificate to Okular to sign pdfs. Okular requires Poppler to be at least version 21.01, to add digital certificate signature, but the Poppler ve...
Integrity asked 2/11, 2023 at 13:12

7

Solved

I have a PDF document that also contains images. Now I want to know the resolution of these images. A first step would be to somehow get the images out of the PDF document. But how? Is that ev...
Valtin asked 24/7, 2012 at 18:22

16

Solved

The most recent version of ScraperWiki depends on Poppler (or so the GitHub says). Unfortunately, it only specifies how to get it on macOS and Linux, not Windows. A quick googling turned up nothin...
Mcroberts asked 22/8, 2013 at 13:22

3

I'm trying to convert a pdf signed using DocuSign to image format. We are facing the error in convert_from_path method. Code and error are shown below: import pdf2image data=pdf2image.convert_from...
Unquestionable asked 15/3, 2021 at 10:43

4

Solved

Poppler is a PDF rendering library based on the xpdf-3.0 code base. I have already downloaded the tar.xz file from the official site http://poppler.freedesktop.org/ But I do not know what to do wit...
Rondi asked 22/8, 2015 at 12:20

5

Solved

I have to run pdf2image on my Python Lambda Function in AWS, but it requires poppler and poppler-utils to be installed on the machine. I have tried to search in many different places how to do th...
Delenadeleon asked 20/11, 2018 at 23:58

5

Solved

I want to delete / remove all the images in a PDF leaving only the text / font in the PDF with whatever command Line tool possible. I tried using -dGraphicsAlphaBits=1 in a Ghostscript command but...
Maieutic asked 19/12, 2013 at 8:29

2

Solved

I have some PDF files which I want to merge into one. I typically use pdfunite from the Poppler PDF toolkit to do this. However now I'm getting this error: Could not merge encrypted files ('foo.pd...
Baptlsta asked 8/7, 2020 at 13:11

3

I need to extract pdf annotations in Python. All my searches indicate I need Poppler to do this e.g. to run this But I am having a lot of trouble installing Poppler. I'm using Python 3.4 on OS X (...
Condemnatory asked 24/8, 2015 at 7:48

2

Solved

I've been trying to install Poppler to use pdf2image to do some pdf to jpeg conversions using Anaconda Prompt. I tried running this: conda install -c conda-forge poppler However, it gets stuck ...
Dripdry asked 2/8, 2019 at 16:36

1

I was using a python package pdf2img, but I got an error asking me to check if I had installed poppler or not , so I ran "pip install python-poppler-qt5" in a code box in colab , but then I get the...
Shrill asked 27/12, 2019 at 8:43

2

I compiled and install poppler-0.39.0 as per the instruction. By default header files went int \usr\local\include and lib files went into \usr\local\lib. pdftohtml is installed in \usr\local\bin. ...
Albertinealbertite asked 2/2, 2016 at 11:28

2

I've got a pdf from which I want to extract some images using Python. I can easily extract images from the Linux command line using the pdfimages from the poppler-utils library like this: pdfimage...
Fecit asked 22/10, 2014 at 13:47

1

Solved

This link shows how pdfs could be converted to images. Is there a way to zoom my pdfs before converting to images? In my project, i am converting pdfs to pngs and then using Python-tesseract librar...
Clicker asked 22/3, 2019 at 17:58

0

I'm using "pdftotext -bbox file.pdf" to convert a pdf file into HTML. Here's a sample line from the output: <word xMin="351.852025" yMin="42.548936" xMax="365.689478" yMax="47.681498">foo&l...
Elegit asked 6/5, 2018 at 11:23

1

Solved

https://www.fda.gov/downloads/AboutFDA/ReportsManualsForms/Forms/UCM074728.pdf I'm trying to read this pdf using PyPDF2 or Pdfminer, but it is saying that the File has not been decrypted in Pypdf2...
Olgaolguin asked 13/4, 2018 at 18:7

3

I'd like to write a small script (which will run on a headless Linux server) that reads a PDF, highlights text that matches anything in an array of strings that I pass, then saves the modified PDF....
Camise asked 30/9, 2011 at 3:10

3

I have a problem with a script with a command line.. The PHP script never continue.. Have tried to call the command line directly through putty and it outputs alot of errors but returns/completes ...
Cathouse asked 26/2, 2017 at 11:26

6

I have grabbed a pdf from the web using for example import requests pdf = requests.get("http://www.scala-lang.org/docu/files/ScalaByExample.pdf") I would like to modify this code to display it ...
Catamnesis asked 10/2, 2014 at 17:46

1

Is there a way to install poppler for python 3 in linux? I have used apt-get install python-poppler to install it for python 2, but I haven't found how to do it for python 3. I am using Ubuntu 14...
Inject asked 16/9, 2016 at 0:56

1

Solved

pdfseparate requires to specify %d as PDF-page-pattern which is replaced by the page number. $ pdfseparate CFL_1115_ISSUU.pdf cfl-%d.pdf works. It sets separated output file names as cfl-1.pdf, cf...
Parsaye asked 9/12, 2015 at 6:16

2

I desperatly need your help. Im trying to compile statically the poppler library (specially for qt4) on windows with the visual c++ 2008 compiler. To achieve this task I needed to compile a bunch ...
Delegation asked 30/1, 2011 at 7:29

1

Since yesterday I'm trying to extract the text from some highlighted annotations in one pdf, using python-poppler-qt4. According to this documentation, looks like I have to get the text using the ...
Seale asked 10/1, 2014 at 17:24

© 2022 - 2024 — McMap. All rights reserved.