Pandas/numpy 加权平均 ZeroDivisionError

Question

创建一个 lambda 函数来计算加权平均值并将其发送到字典。

wm = lambda x: np.average(x, weights=df.loc[x.index, 'WEIGHTS'])

# Define a dictionary with the functions to apply for a given column:
f = {'DRESS_AMT': 'max', 
     'FACE_AMT': 'sum',
     'Other_AMT': {'weighted_mean' : wm}}

# Groupby and aggregate with dictionary:
df2=df.groupby(['ID','COL1'], as_index=False).agg(f)

此代码有效，但如果权重加起来为 0，加权平均 lambda 函数将失败 ZeroDivisionError。在这些情况下，我希望输出 'Other_AMT' 只是 0.

我阅读了有关使用 np.ma.average（掩蔽平均）的文档，但无法理解如何实施它

Answer 1

这还不够吗？

def wm(x):
    try: 
        return np.average(x, weights=df.loc[x.index, 'WEIGHTS'])
    except ZeroDivisionError:
        return 0

f = {'DRESS_AMT': 'max', 
     'FACE_AMT': 'sum',
     'Other_AMT': {'weighted_mean' : wm} }

df2=df.groupby(['ID','COL1'], as_index=False).agg(f)

Pandas/numpy 加权平均 ZeroDivisionError

Pandas/numpy weighted average ZeroDivisionError

python

numpy

divide-by-zero

weighted-average

pandas