Using openpyxl
I tried to read from the fifth line for some files. The files' first four lines are the header. Then the main content has a different format from the header. And I tried the method:
import openpyxl
file_name="xxx.xlsx"
wb = openpyxl.load_workbook(filename=file_name, use_iterators = True)
first_sheet = workbook.get_sheet_names()[0]
ws = workbook.get_sheet_by_name(first_sheet)
for index, row in enumerate(ws.iter_rows()):
if start < index < stop:
for c in row:
print c.value
It will always have the error:
IndexError: list index out of range
If I delete the first four lines, the data can be read into Python easily. But I have hundreds of such files, each file has a header for four lines. It will take way much time to delete all the headers from the files.
How to skip first several lines when reading using openpyxl
correctly?