根据其他列的内容删除列中的某些值

Question

我有一个如下所示的数据框：

   Deal      Year  Financial Data1  Financial Data2  Financial Data3  Quarter
0     1  1991/1/1              122              123              120        1
3     1  1991/1/1              122              123              120        2
6     1  1991/1/1              122              123              120        3
1     2  1992/1/1               85               90               80        4
4     2  1992/1/1               85               90               80        5
7     2  1992/1/1               85               90               80        6
2     3  1993/1/1               85               90              100        1
5     3  1993/1/1               85               90              100        2
8     3  1993/1/1               85               90              100        3

但是我只希望在每笔交易中显示第一季度的财务数据 1，然后将整个数据再次合并到一个列中。

最终结果应该是这样的：

       Deal    Year         Financial Data   Quarter
    0     1  1991/1/1              122          1
    3     1  1991/1/1              123          2
    6     1  1991/1/1              120          3
    1     2  1992/1/1               85          4
    4     2  1992/1/1               90          5
    7     2  1992/1/1               80          6
    2     3  1993/1/1               85          1
    5     3  1993/1/1               90          2
    8     3  1993/1/1              100          3

Answer 1

Okie dokie，使用 np.where() 我认为这可以满足您的要求：

import pandas as pd
import numpy as np

df = pd.read_fwf(StringIO(
"""Deal      Year  Financial_Data1  Financial_Data2  Financial_Data3  Quarter
1     1991/1/1  122              123              120              1
1     1991/1/1  122              123              120              2
1     1991/1/1  122              123              120              3
2     1992/1/1  85               90               80               4
2     1992/1/1  85               90               80               5
2     1992/1/1  85               90               80               6
3     1993/1/1  85               90               100              1
3     1993/1/1  85               90               100              2
3     1993/1/1  85               90               100              3"""))


df['Financial_Data'] = np.where(
    # if 'Quarter'%3==1
    df['Quarter']%3==1,
    # Then return Financial_Data1
    df['Financial_Data1'], 
    # Else
    np.where(
        # If 'Quarter'%3==2
        df['Quarter']%3==2,
        # Then return Financial_Data2 
        df['Financial_Data2'],  
        # Else return Financial_Data3
        df['Financial_Data3']
    )
)

# Drop Old Columns
df = df.drop(['Financial_Data1', 'Financial_Data2', 'Financial_Data3'], axis=1)

print(df)

输出：

   Deal      Year  Quarter  Financial_Data
0     1  1991/1/1        1             122
1     1  1991/1/1        2             123
2     1  1991/1/1        3             120
3     2  1992/1/1        4              85
4     2  1992/1/1        5              90
5     2  1992/1/1        6              80
6     3  1993/1/1        1              85
7     3  1993/1/1        2              90
8     3  1993/1/1        3             100

(PS：我不是 100% 确定你打算如何处理第 4-6 季度，在这个例子中我只是将它们视为 1-3)

根据其他列的内容删除列中的某些值

Dropping certain values in columns depending on contents of other column

python

transformation

dataframe