使用 pandas 将数据集列设置为变量时出现 KeyError

Question

我正在尝试将数据集列（从 csv 导入）设置为新变量，以便在直方图中绘制数据：

x = gcbs[gcbs['E1']]   #dataset is gcbs, column is 'E1'

plt.hist(x, bins = 9)

我收到以下错误：

KeyError: "None of [Int64Index([...,\n dtype='int64', length=2495)] are in the [columns]"

为什么会出现此错误？

Answer 1

如果你想使用布尔掩码过滤，你需要使用 loc，如果你只想要一个列，你可以使用 [] 来获得它，例如

df = pd.DataFrame({"x":[1,2,3,4],"y":[10,20,30,40]})
x = df["x"] #get the column "x"

df_gt_2 = df.loc[df["x"]>2] #Get the rows where "x" is greather than 2

所以如果我没看错你的问题，你只需要完成第一部分

import matplotlib.pyplot as plt

x = gcbs["x"]
plt.hist(x,bins=9)

使用 pandas 将数据集列设置为变量时出现 KeyError

Getting KeyError when setting dataset column to variable with pandas

python

keyerror