Pandas 数据框应用 - lambda 函数附加到列表

Pandas Dataframe Apply - lambda function appending to list

我在 pandas 数据框的一列中包含一个列表。我想将 "price_label" 列中的值附加到列表中。

我目前正在使用我制作的功能来执行此操作,但这是最好的方法吗?我觉得我在这里错过了什么?

我的方法可行,但不够完美。任何建议。

前一名单:

+---+--------------+-----------+-----------------------------------------+-------+------------------------------+-------------+
|   | prodref      | prodcateg | webproddesc                             | price | categories                   | price_label |
+---+--------------+-----------+-----------------------------------------+-------+------------------------------+-------------+
| 0 | 1004 10-14MM | STR4      | Gold Plated Bracelet 1004 - 10-14MM     | 5.98  | [Gold, Plated, Bracelet]     | 1           |
| 1 | 1004 16-22MM | STR4      | Gold Plated Bracelet 1004 - 16-22MM     | 5.98  | [Gold, Plated, Bracelet]     | 1           |
| 2 | 1007 10-14MM | STR4      | Bi-Colour Bracelet 1007 - 10-14MM       | 5.98  | [BiColour, Bracelet]         | 1           |
| 3 | 1007 16-22MM | STR4      | Bi-Colour Bracelet 1007 - 16-22MM       | 5.98  | [BiColour, Bracelet]         | 1           |
| 4 | 1010 10-14MM | STR4      | Stainless Steel Bracelet 1010 - 10-14MM | 5.98  | [Stainless, Steel, Bracelet] | 1           |
| 5 | 1010 16-22MM | STR4      | Stainless Steel Bracelet 1010 - 16-22MM | 5.98  | [Stainless, Steel, Bracelet] | 1           |
| 6 | W108/22      | STR1      | Grey Calf Watch Strap (S) - W108/22     | 4.18  | [Grey, Calf, Watch]          | 1           |
| 7 | W404/14      | STR1      | White Lizard Grain Strap (S) - W404/14  | 5.98  | [White, Lizard, Grain]       | 1           |
| 8 | W404/18      | STR1      | White Lizard Grain Strap (S) - W404/18  | 5.98  | [White, Lizard, Grain]       | 1           |
+---+--------------+-----------+-----------------------------------------+-------+------------------------------+-------------+

函数:

def appendPrice(vert):
    cat_list = vert["categories"]
    cat_list.append(vert["price_label"])
    return cat_list

test["categories"] = test.apply(lambda x:appendPrice(x),axis=1)  

输出

+---+--------------+-----------+-----------------------------------------+-------+---------------------------------+-------------+
|   | prodref      | prodcateg | webproddesc                             | price | categories                      | price_label |
+---+--------------+-----------+-----------------------------------------+-------+---------------------------------+-------------+
| 0 | 1004 10-14MM | STR4      | Gold Plated Bracelet 1004 - 10-14MM     | 5.98  | [Gold, Plated, Bracelet, 1]     | 1           |
| 1 | 1004 16-22MM | STR4      | Gold Plated Bracelet 1004 - 16-22MM     | 5.98  | [Gold, Plated, Bracelet, 1]     | 1           |
| 2 | 1007 10-14MM | STR4      | Bi-Colour Bracelet 1007 - 10-14MM       | 5.98  | [BiColour, Bracelet, 1]         | 1           |
| 3 | 1007 16-22MM | STR4      | Bi-Colour Bracelet 1007 - 16-22MM       | 5.98  | [BiColour, Bracelet, 1]         | 1           |
| 4 | 1010 10-14MM | STR4      | Stainless Steel Bracelet 1010 - 10-14MM | 5.98  | [Stainless, Steel, Bracelet, 1] | 1           |
| 5 | 1010 16-22MM | STR4      | Stainless Steel Bracelet 1010 - 16-22MM | 5.98  | [Stainless, Steel, Bracelet, 1] | 1           |
| 6 | W108/22      | STR1      | Grey Calf Watch Strap (S) - W108/22     | 4.18  | [Grey, Calf, Watch, 1]          | 1           |
| 7 | W404/14      | STR1      | White Lizard Grain Strap (S) - W404/14  | 5.98  | [White, Lizard, Grain, 1]       | 1           |
| 8 | W404/18      | STR1      | White Lizard Grain Strap (S) - W404/18  | 5.98  | [White, Lizard, Grain, 1]       | 1           |
| 9 | W409/14      | STR1      | Pink Lizard Grain Strap (S) - W409/14   | 5.98  | [Pink, Lizard, Grain, 1]        | 1           |
+---+--------------+-----------+-----------------------------------------+-------+---------------------------------+-------------+

正如@ALollz 指出的那样,在 Series 或 DataFrame 中使用列表很少是可行的方法。

就是说,如果您想暂时保留您的模式,您可以完全放弃 appendPrice

test["categories"] = test.apply(lambda x: x.categories + [x.price_label],axis=1) 

否则你可以分享更多关于你到底想用这个完成什么的颜色。

假设

df["categories"]
0        [Gold, Plated, Bracelet]
1        [Gold, Plated, Bracelet]
2            [BiColour, Bracelet]
3            [BiColour, Bracelet]
4    [Stainless, Steel, Bracelet]
5    [Stainless, Steel, Bracelet]
6             [Grey, Calf, Watch]
7          [White, Lizard, Grain]
8          [White, Lizard, Grain]
Name: categories, dtype: object

df['price_label']int,以下将执行:

df.apply(lambda row: row["categories"] + list(str(row["price_label"])), axis=1)
0        [Gold, Plated, Bracelet, 1]
1        [Gold, Plated, Bracelet, 1]
2            [BiColour, Bracelet, 1]
3            [BiColour, Bracelet, 1]
4    [Stainless, Steel, Bracelet, 1]
5    [Stainless, Steel, Bracelet, 1]
6             [Grey, Calf, Watch, 1]
7          [White, Lizard, Grain, 1]
8          [White, Lizard, Grain, 1]
dtype: object

这行代码将执行您的请求。它会将 price_label 列中的值附加到每个类别行的末尾。对于更大的数据,此操作比 df.apply() 更有效。

list(map(lambda k: df['categories'][k].append(df['price_label'][k]),df.index))