我有一个名为template.xlsx的模板excel文件,其中包含许多工作表.我想将数据从一个单独的.csv文件复制到template.xlsx的第一张表(命名为data)中,并将新文件另存为result.xlsx,同时保留原始模板文件.
我想从template.xlsx数据表的第二行开始粘贴数据
这是我到目前为止开发的代码
import pandas as pd
from openpyxl.utils.dataframe import dataframe_to_rows
import openpyxl
from shutil import copyfile
template_file = 'template.xlsx' # Has a header in row 1 already which needs to be skipped while pasting data but it should be there in the output file
output_file = 'result.xlsx'
copyfile(template_file, output_file)
df = pd.read_csv('input_file.csv') #The file which is to be pasted in the template
wb = openpyxl.load_workbook(output_file)
ws = wb.get_sheet_by_name('data') #Getting the sheet named as 'data'
for r in dataframe_to_rows(df, index=False, header=False):
ws.append(r)
wb.save(output_file)
我无法获得所需的输出
模板文件(带有额外的行)在左侧,输入文件(要复制到模板的数据)在右侧,如下所示
解决方法:
确实并不需要使用shutil模块,因为您可以使用openpyxl.load_workbook加载模板,然后以其他名称保存.
另外,在for循环中的ws.append(r)将以template.xlsx的形式追加到现有数据中,听起来您只想保留标题.
我在下面提供了一个完全可复制的示例,该示例出于演示目的创建了“ template.xlsx”.然后加载’template.xlsx’,向其中添加新数据并将其另存为result.xlsx.
from openpyxl import Workbook
from openpyxl import load_workbook
from openpyxl.utils.dataframe import dataframe_to_rows
from openpyxl.chart import PieChart, Reference, Series
import pandas as pd
template_file = 'template.xlsx'
output_file = 'result.xlsx'
#This part creates a workbook called template.xlsx with a sheet called 'data' and sheet called 'second_sheet'
writer = pd.ExcelWriter('template.xlsx', engine='openpyxl')
wb = writer.book
df = pd.DataFrame({'Pie': ["Cream", "Cherry", "Banoffee", "Apple"],
'Sold': [2, 2, 1, 4]})
df.to_excel(writer, index=False, sheet_name='data', startrow=1)
ws = writer.sheets['data']
ws['A1'] = 1
ws['B1'] = 2
ch = PieChart()
labels = Reference(ws, min_col=1, min_row=3, max_row=6)
data = Reference(ws, min_col=2, min_row=3, max_row=6)
ch.series = (Series(data),)
ch.title = "Pies sold"
ws.add_chart(ch, "D2")
ws = wb.create_sheet("Second_sheet")
ws['A1'] = 'This Sheet will not be overwitten'
wb.save(template_file)
#Now we load workbook called template.xlsx modify the 'data' sheet and save under a new name
#template.xlsx has not been modified
df_new = pd.DataFrame({'different_name': ["Blueberry", "Pumpkin", "Mushroom", "Turnip"],
'different_numbers': [4, 6, 2, 1]})
wb = load_workbook(template_file)
ws = wb.get_sheet_by_name('data') #Getting the sheet named as 'data'
rows = dataframe_to_rows(df_new, index=False, header=False)
for r_idx, row in enumerate(rows, 1):
for c_idx, value in enumerate(row, 1):
ws.cell(row=r_idx+2, column=c_idx, value=value)
wb.save(output_file)
预期产量: