pypdf - McMap

3

Solved

Convert PDF page to image with PyPDF2 and BytesIO

I have a function that gets a page from a PDF file via PyPDF2 and should convert the first page to a png (or jpg) with Pillow (PIL Fork) from PyPDF2 import PdfFileWriter, PdfFileReader import os fr...

python pdf pypdf bytesio

Aldrin asked 11/3, 2017 at 9:27

9

EOF marker not found while use PyPDF2 merge pdf file in python

When I use the following code from PyPDF2 import PdfFileMerger merge = PdfFileMerger() for newFile in nlst: merge.append(newFile) merge.write("newFile.pdf") Something happened as foll...

python pdf pypdf

Taynatayra asked 29/7, 2017 at 14:50

3

Excluding the Header and Footer Contents of a page of a PDF file while extracting text?

Is it possible to exclude the contents of footers and headers of a page from a pdf file during extracting the text from it. As these contents are least important and almost redundant. Note: For ex...

python-3.x pdf text nlp pypdf

Gismo asked 27/8, 2018 at 12:53

5

Solved

How to get bookmark's page number

from typing import List from PyPDF2 import PdfFileReader from PyPDF2.generic import Destination def get_outlines(pdf_filepath: str) -> List[Destination]: """Get the bookmarks o...

python pypdf

Julee asked 30/11, 2011 at 16:52

8

Solved

Camelot: DeprecationError: PdfFileReader is deprecated

I have been using camelot for our project, but since 2 days I got following errorMessage. When trying to run following code snippet: import camelot tables = camelot.read_pdf('C:\\Users\\user\\Downl...

python pypdf python-camelot

Margie asked 28/12, 2022 at 11:35

5

PyPDF2 compression

I am struggling to compress my merged pdf's using the PyPDF2 module. this is my attempt based on http://www.blog.pythonlibrary.org/2012/07/11/pypdf2-the-new-fork-of-pypdf/ import PyPDF2 path = ope...

python pdf pypdf

Isochronize asked 1/4, 2014 at 3:42

4

Change fields in pdf using pypdf?

i try to update entry-fields in a pdf using the following code with the pytho-module pypdf. At first i read the pdf-files and get all available fields on this firt pdf-page. from pypdf import PdfRe...

python pypdf

Mares asked 31/7, 2023 at 11:28

6

Solved

Change metadata of pdf file with pypdf2

I want to add a metadata key-value pair to the metadata of a pdf file. I found a several years old answer, but I think this is way to complicated. I guess there is an easier way today: https://sta...

python pdf pypdf pdf-manipulation

Saltation asked 20/10, 2017 at 13:6

4

Select only first page of PDF with PyPDF2

I am trying to strip out only the first page of multiple PDF files and combine into one file. (I receive 150 PDF files a day, the first page is the invoice which I need, the following three to 12 p...

python pdf merge split pypdf

Dicho asked 5/11, 2017 at 19:54

2

Solved

Place a vertical or rotated text in a PDF with Python

I'm currently generating a PDF with PyFPDF. I also need to add a vertical/rotated text. Unfortunately, it's not directly supported in PyPDF2 as far as I see. There are solutions for FPDF for PHP. I...

python pdf fpdf pypdf

Groove asked 12/5, 2018 at 13:42

3

Solved

PyPDF2.errors.PdfReadError: PDF starts with '♣▬', but '%PDF-' expected

I have a folder containing a lot of sub-folders, with PDF files inside. It's a real mess to find information in these files, so I'm making a program to parse these folders and files, searching for ...

python pdf pypdf

Bacchus asked 12/5, 2022 at 12:55

3

How to stitch two pdf pages into one in python

I am using python, and I want to combine two PDF pages into a single page. My purpose is to combine these two pages into one, not two PDFs. Is there any way to combine the two PDFs one by one? I do...

python pdf pypdf

Illfavored asked 13/4, 2019 at 0:7

9

Unable to use pypdf module

I have installed the pyPdf module successfully using the command pip install pydf but when I use the module using the import command I get the following error: enC:\Anaconda3\lib\site-packages\pyP...

python-3.x pypdf

Gillispie asked 9/2, 2017 at 7:19

2

Solved

How to remove annotations in pdf in Python 3

My original goal was to remove the extensive white margins on my PDF pages. Then I found this purpose can be achieved by scaling the page using the code below, but annotations are not scaled. imp...

python python-3.x pypdf

Sea asked 14/3, 2019 at 23:55

13

Solved

How to check if PDF is scanned image or contains text

I have a large number of files, some of them are scanned images into PDF and some are full/partial text PDF. Is there a way to check these files to ensure that we are only processing files which ar...

python python-3.x pypdf pdfminer pdf-extraction

Decoupage asked 16/4, 2019 at 8:54

5

Solved

How to add watermark in all pages of PDF files with python?

I'm try to adding watermark to every pages of my PDF file.My PDF files have 58 pages but my output file has get only last page in my PDF file. This's my code: from PyPDF2 import PdfFileReader, Pd...

python pypdf

Svensen asked 8/6, 2020 at 11:46

6

split a pdf based on outline

i would like to use pyPdf to split a pdf file based on the outline where each destination in the outline refers to a different page within the pdf. example outline: main --> points to page 1 s...

python pdf pypdf

Ascospore asked 16/12, 2009 at 23:0

6

Solved

How can I remove a URL channel from Anaconda?

Recently I needed to install PyPdf2 to one of my programs using Anaconda. Unfortunately, I failed, but the URLs that was added to Anaconda environment prohibit the updates of all the Conda librarie...

python anaconda channel pypdf

Speedway asked 18/9, 2016 at 13:47

3

Solved

How can I rotate a page with PyPDF2?

I'm editing a PDF file with pyPDF2. I managed to generate the PDF I want but I've yet to rotate some pages. I went to the documentation and found two methods: rotateClockwise and rotateCounterClock...

python pypdf

Irena asked 6/3, 2017 at 0:27

4

Create outlines/TOC for existing PDF in Python

I'm using pyPdf to merge several PDF files into one. This works great, but I would also need to add a table of contents/outlines/bookmarks to the PDF file that is generated. pyPdf seems to have on...

python pdf pypdf reportlab tableofcontents

Hinduism asked 27/5, 2011 at 20:38

3

Solved

PyPDF2 split pdf by pages

I wanna split pdf file using PyPDF2. All examples in net is too difficult or don't work or always give error "AttributeError: 'PdfFileWriter' object has no attribute 'stream'" Can someone help wi...

python pypdf

Pizzeria asked 17/7, 2017 at 12:21

3

Solved

Extract pdf text within bounding box directly into python

I'm trying to extract the text of a pdf within a given bounding rectangle. I understand there are tools for pdf scraping such as pdfminer, pypdf, and pdftotext. I've experimented with all 3, and so...

python pdf text-extraction pypdf pdfminer

Waltz asked 9/4, 2019 at 0:26

3

Solved

How to get PDF file metadata 'Page Size' using Python?

I try to use PyPDF2 module in Python 3 but I can't display 'Page Size' property. I would like to know what the sheet of paper dimensions were before scanning to PDF file. Something like this: ...

python scanning pypdf page-size

Jellicoe asked 15/9, 2017 at 6:22

1

Why does pypdf stuff text with extra spaces when extracting text?

pypdf==3.11.0, like previous versions, returns text strings with the occasional inserted single space. But Windows Search and the "Find" in Adobe reader find the text unadulterated, and i...

pypdf python-3.11

Tameika asked 24/6, 2023 at 14:30

4

Solved

Maintained alternatives to PyPDF2

I'm using the PyPDF2 library for extracting text, images, page width and heights, annotations, and other attributes from pdf documents. However, the library has many bugs and issues and seems not t...

python pdf pypdf

Arlynearlynne asked 31/7, 2020 at 22:15

pypdf Questions

Recommended topics

Hot tags