pandas for 循环中的 isin 函数
pandas isin function on a for loop
1.csv
cut price depth carat table
0 Good 327 57.9 0.23 65.0
1 Good 335 63.3 0.31 58.0
2 Very Good 336 62.8 0.24 57.0
3 Very Good 336 62.3 0.24 57.0
4 Very Good 337 61.9 0.26 55.0
5 Premium 326 59.8 0.21 61.0
6 Premium 334 62.4 0.29 58.0
7 Good 400 64.0 0.30 55.0
2.csv
cut price depth carat table
0 Good 327 57.9 0.23 65.0
1 Good 335 63.3 0.31 58.0
2 Very Good 336 62.8 0.24 57.0
3 Very Good 336 62.3 0.24 57.0
4 Very Good 337 61.9 0.26 50.0
5 Premium 326 59.8 0.21 61.0
6 Premium 334 60.4 0.29 58.0
7 Good 399 64.0 0.30 55.0
只有 2.csv 中的 4、6、7 行被更改
我正在寻找
这样输出
cut price depth carat table
4 Very Good 337 61.9 0.26 50.0
6 Premium 334 60.4 0.29 58.0
7 Good 399 64.0 0.30 55.0
任何人都可以分享您的经验任何形式的帮助都很好
import pandas as pd
f1 = pd.read_csv('1.csv')
f2 = pd.read_csv('2.csv')
columns_list = ['cut', 'price', 'depth', 'carat', 'table']
new_df= f2[~f2.price.isin(f1.price)]
print(new_df)
这是我写的示例代码,运行良好,但我需要使用
f2[~f2.price.isin(f1.price)]
在循环中获取 'price' space 上的每个列名称,并且 return value.i 会像这样以正常方式尝试
for i in columns_list:
price = f2[~f2.i.isin(f1.i)]
print(price)
但是 pandas 命令不能像这样工作它是 return 像
这样的错误
AttributeError: 'DataFrame' object has no attribute 'i'
感谢阅读,希望您能理解
IIUC,DataFrame.merge
与 indicator = True
:
f2_filtered = (f2.merge(f1, how='outer', indicator=True)
.query('_merge == "left_only"')
.drop(columns = '_merge'))
print(f2_filtered)
输出
cut price depth carat table
4 Very_Good 337 61.9 0.26 50.0
6 Premium 334 60.4 0.29 58.0
7 Good 399 64.0 0.30 55.0
1.csv
cut price depth carat table
0 Good 327 57.9 0.23 65.0
1 Good 335 63.3 0.31 58.0
2 Very Good 336 62.8 0.24 57.0
3 Very Good 336 62.3 0.24 57.0
4 Very Good 337 61.9 0.26 55.0
5 Premium 326 59.8 0.21 61.0
6 Premium 334 62.4 0.29 58.0
7 Good 400 64.0 0.30 55.0
2.csv
cut price depth carat table
0 Good 327 57.9 0.23 65.0
1 Good 335 63.3 0.31 58.0
2 Very Good 336 62.8 0.24 57.0
3 Very Good 336 62.3 0.24 57.0
4 Very Good 337 61.9 0.26 50.0
5 Premium 326 59.8 0.21 61.0
6 Premium 334 60.4 0.29 58.0
7 Good 399 64.0 0.30 55.0
只有 2.csv 中的 4、6、7 行被更改
我正在寻找
这样输出
cut price depth carat table
4 Very Good 337 61.9 0.26 50.0
6 Premium 334 60.4 0.29 58.0
7 Good 399 64.0 0.30 55.0
任何人都可以分享您的经验任何形式的帮助都很好
import pandas as pd
f1 = pd.read_csv('1.csv')
f2 = pd.read_csv('2.csv')
columns_list = ['cut', 'price', 'depth', 'carat', 'table']
new_df= f2[~f2.price.isin(f1.price)]
print(new_df)
这是我写的示例代码,运行良好,但我需要使用
f2[~f2.price.isin(f1.price)]
在循环中获取 'price' space 上的每个列名称,并且 return value.i 会像这样以正常方式尝试
for i in columns_list:
price = f2[~f2.i.isin(f1.i)]
print(price)
但是 pandas 命令不能像这样工作它是 return 像
这样的错误AttributeError: 'DataFrame' object has no attribute 'i'
感谢阅读,希望您能理解
IIUC,DataFrame.merge
与 indicator = True
:
f2_filtered = (f2.merge(f1, how='outer', indicator=True)
.query('_merge == "left_only"')
.drop(columns = '_merge'))
print(f2_filtered)
输出
cut price depth carat table
4 Very_Good 337 61.9 0.26 50.0
6 Premium 334 60.4 0.29 58.0
7 Good 399 64.0 0.30 55.0