Delete final line in file with python

D

11

41

How can one delete the very last line of a file with python?

Input File example:

hello
world
foo
bar

Output File example:

hello
world
foo

I've created the following code to find the number of lines in the file - but I do not know how to delete the specific line number.

    try:
        file = open("file")
    except IOError:
        print "Failed to read file."
    countLines = len(file.readlines())

Deberadeberry answered 10/12, 2009 at 0:57 Comment(6)

Are you trying to actually remove the line from the file, on disk? If so, make sure you understand that files don't have "lines" from the filesystem's point of view. Lines are a convention of programmers and programs. What you see as a "line" is a sequence of bytes somewhere in the middle of lots of other bytes. To remove the last "line", you could truncate the file at the byte corresponding to the first character in the line. That's not difficult (you just have to find it), but there's not much point if the files involved are not many megabytes in size. – Gabbard 10/12, 2009 at 1:15

What if the last line is an empty line? – Viper 10/12, 2009 at 1:33

Last line is not blank. I remove all blank lines with another python snippet (from google). – Deberadeberry 10/12, 2009 at 1:39

? The file contains no blanks lines? The example above is what you should look on, nothing else. The last line is what I need to remove. Why the condescension? I've almost got it with Strawberry's answer. – Deberadeberry 10/12, 2009 at 1:51

The file in question is not in memory - it is as is above. – Deberadeberry 10/12, 2009 at 1:55

There was no condescension in my questions... just puzzlement, and maybe skepticism that you're doing this in a sensible manner. You wrote about the blank line removal. If the file is in memory, it's not a file, it's a list of strings. If you're already using Python on this "file" to remove blank lines, and this is an entirely separate step, then you're processing this data twice, inefficiently. These are all simple facts, but I'll stop now, if you don't like the help. – Gabbard 10/12, 2009 at 16:46

G

23

You could use the above code and then:-

lines = file.readlines()
lines = lines[:-1]

This would give you an array of lines containing all lines but the last one.

Gewirtz answered 10/12, 2009 at 1:1 Comment(9)

Will this work well for large files? E.g. thousands of lines? – Deberadeberry 10/12, 2009 at 1:3

It might not work well for files bigger than a megabyte or two. Depends on your definition of "well". It should be perfectly fine for any desktop use for a few thousand lines. – Jumada 10/12, 2009 at 1:4

Well - Within a second or two. – Deberadeberry 10/12, 2009 at 1:7

Is there no other way to directly delete a specific line? Or is an array the way to go? – Deberadeberry 10/12, 2009 at 1:8

Nazarius: There isn't any way to delete a specific line. You can however truncate a file or append to it. Since you want to delete the last line, you can just truncate. – Camarilla 10/12, 2009 at 1:17

@Deberadeberry an option could be to use os.system("sed '$d' file") to run sed, at the point that a binary will work faster over big files and processing in general. Truncate file seems the most fastest way. Anyway, this question has many usefull options :) +1 for this question. – Zanthoxylum 7/12, 2015 at 21:42

Would this read the complete file from start to end? – Stuffing 2/6, 2021 at 20:47

@Stuffing Yes, in this example it would read all the lines into an array in memory. – Gewirtz 4/6, 2021 at 14:28

This doesn't remove the line from the file - it only removes it from the list lines while the file on disk still has it in place. – Rutkowski 4/5 at 0:56

P

88

Because I routinely work with many-gigabyte files, looping through as mentioned in the answers didn't work for me. The solution I use:

with open(sys.argv[1], "r+", encoding = "utf-8") as file:

    # Move the pointer (similar to a cursor in a text editor) to the end of the file
    file.seek(0, os.SEEK_END)

    # This code means the following code skips the very last character in the file -
    # i.e. in the case the last line is null we delete the last line
    # and the penultimate one
    pos = file.tell() - 1

    # Read each character in the file one at a time from the penultimate
    # character going backwards, searching for a newline character
    # If we find a new line, exit the search
    while pos > 0 and file.read(1) != "\n":
        pos -= 1
        file.seek(pos, os.SEEK_SET)

    # So long as we're not at the start of the file, delete all the characters ahead
    # of this position
    if pos > 0:
        file.seek(pos, os.SEEK_SET)
        file.truncate()

Pontiff answered 10/12, 2009 at 0:57 Comment(3)

this is the best answer. use "with" statement to save a line :) – Bacteriostasis 25/2, 2015 at 2:18

I ran into some compatibility issues (using Py3) when using this method on files that were used on both mac and windows, because internally Mac uses a different line terminator than Windows (which uses 2: cr and lf). The solution was to open the file in binary read mode ("rb+"), and search for the binary newline character b"\n". – Rahmann 6/10, 2016 at 15:49

If you open the file with "a+" instead of "r+", can you skip the file.seek(0, os.SEEK_END)? – Advowson 26/6, 2023 at 18:9

G

23

You could use the above code and then:-

lines = file.readlines()
lines = lines[:-1]