如何使用 python 从 xlsx 文件中的另一列中减去一列中的单元格值

How to subtract cell values from one column with cell values from another column in xlsx files using python

我想用另一列的单元格值减去一列的单元格值,并将总和写入 excel 文件中的新列。然后我想要将总和(如果不等于 0)添加到列表中供以后使用。我的 excel 文件中的数据结构如下:

Name | Number | Name1 | Number1
Name2 | Number2 | Name3 | Number3
....
Namex | Numberx | Namey |Numbery

我想将数字彼此相减,然后将总和添加到新列中,如下所示:

Name| Number | Name1 | Number1 | Sum of (Number - Number1)

我曾尝试使用 openpyxl 来执行此操作,但我真的很困惑,因为文档与 Python 的早期版本与更新版本有很大不同。我在 Python 3.4 工作。我很高兴得到有关您推荐我使用哪个模块的建议。 到目前为止,我的代码给我错误,因为我将 excel 文件称为生成器,而不是可订阅的。我不确定如何搜索和读取 excel 文件,同时使其可订阅,以便可以写入。谁能帮帮我?

这是我的代码:

from openpyxl import Workbook, load_workbook

def analyzexlsx(filepath):
    numbers = []
    excel_input = load_workbook(filepath)
    filepath = [pth for pth in Path.cwd().iterdir()
                  if pth.suffix == '.xlsx'] #Want to iterate through several excel files in a folder.
    ws = excel_input.active
    cols = tuple(ws.columns)
    col_b = cols[1] 
    col_e = cols[4] 
    for j, k in zip(col_e, col_b): 
        if None:
            print('None')
        equally = (int(j.value) - int(k.value)) #line 13, error. Trying to subtract column cell values.
        if equally != 0: #If the columns sum is not equal to 0, it is to be added to the numbers list.
            numbers.append(j.row)

        else:
            pass

    col1 = []
    col2 = []
    col4 = []
    col5 = []
    col7 = []
    col8 = []

    mainlist = []
    try:
        for row in numbers:
            col1.append(str(ws.cell(row=row, column=1).value))
            col2.append(str(ws.cell(row=row, column=2).value))
            col4.append(ws.cell(row=row, column=4).value)
            col5.append(ws.cell(row=row, column=5).value)
            col7.append(ws.cell(row=row, column=7).value)
            col8.append(ws.cell(row=row, column=8).value)
    finally:
        for i, j, k, l, m, n in zip(col1, col2, col4, col5, col7, col8):
            mainlist.append(i + ", " + j + ", " + k + ", " + l + ", " + m + ", " + n)
    return mainlist

Traceback (most recent call last):
    Line 13, in analyzexlsx
        equally = (int(j.value) - int(k.value))
    TypeError: int() argument must be a string or a number, not 'NoneType

我真的很高兴得到答案,因为我已经为此工作了很长一段时间,现在我被困住了。我是 Python 的新手。

首先通过 read_excel 从 excel 创建 DataFrame

然后需要用 4 列减去 2.

df = pd.read_excel('file.xlsx')

#select by column name
df['E'] = df['B'] - df['D']

#select by positions, but python count from 0 so for 2. column need 1
df['E'] = df.iloc[:, 1] - df.iloc[:, 3]

也许还可以帮助检查 documentation