In order to copy the text with its styles, you will need to write your own function, as there is no python-docx
function that does such a thing.
This is the function I wrote:
def get_para_data(output_doc_name, paragraph):
"""
Write the run to the new file and then set its font, bold, alignment, color etc. data.
"""
output_para = output_doc_name.add_paragraph()
for run in paragraph.runs:
output_run = output_para.add_run(run.text)
# Run's bold data
output_run.bold = run.bold
# Run's italic data
output_run.italic = run.italic
# Run's underline data
output_run.underline = run.underline
# Run's color data
output_run.font.color.rgb = run.font.color.rgb
# Run's font data
output_run.style.name = run.style.name
# Paragraph's alignment data
output_para.paragraph_format.alignment = paragraph.paragraph_format.alignment
How The Function Works
- Adds a new
paragraph
object to the file.
- Adds a new
run
to that paragraph.
- Checks whether each of the styles bold, italic and underline is
True
, False
, None
. If it's True
, the run will be in that style, if it's False
, it won't be in that style, and if it's None
, it will be inherited by the default style of the paragraph it's in. Then it applies the styles to the run.
- Checks what's the color of the run in RGB and applies the found color to the
run
.
- Checks what's the font of the run and applies the found font to the
run
.
- Checks what's the alignment of the run and applies the found alignment setting to the
run
.
How to Use the Function:
You need to give it the name you gave your output document and the paragraphs you want to copy.
For Example:
# Imports
input_doc = Document('InputDoc.docx')
output_doc = Document()
# Call the function
get_para_data(output_doc, input_doc.paragraphs[3])
# Save the new file
output_doc.save('OutputDoc.docx')
If you'd like to copy the entire document I suggest you do this:
for para in input_doc.paragraphs:
get_para_data(output_doc, para)
output_doc.save('OutputDoc.docx')