移动行值包含特定字符串到 Python 中的新列
Moving row values contains specific string to new column in Python
我正在重组数据框。示例数据框如下:
df = pd.DataFrame()
df ['Stats'] = ['Def duels', 'Def duels Won','Back passes', 'Back passes[Acc]','Dribbles', 'Dribbles[Suc]']
df ['Value'] = [5,2.5,60,55,5,2]
我想创建一个新列,其中只包含 'Won'、'Acc' 和 'Suc' 等字符串。预期数据框如下:
我该如何解决这个问题?
解决方案:
# initialize Stats1 with empty strings
df['Stats1'] = ''
# copy values from `Stats`
df.iloc[1::2,-1] = df['Stats']
# replace the copied values with empty strings
df['Stats'] = np.where(df['Stats1'].ne(''), '', df['Stats'])
输出:
Stats Value Stats1
0 Def duels 5.0
1 2.5 Def duels Won
2 Back passes 60.0
3 55.0 Back passes[Acc]
4 Dribbles 5.0
5 2.0 Dribbles[Suc]
IIUC
s=df.Stats.str.contains('Won|Acc|Suc')
df['New']=df.Stats.where(s,'')
df.Stats=df.Stats.mask(s,'')
df
Stats Value New
0 Def duels 5.0
1 2.5 Def duels Won
2 Back passes 60.0
3 55.0 Back passes[Acc]
4 Dribbles 5.0
5 2.0 Dribbles[Suc]
使用 str.contains
和 np.where
df['stat1'] = np.where(df['Stats'].str.contains('won|acc|suc',case=False),df['Stats'],'')
df['Stats'] = np.where(df['Stats'].str.contains('won|acc|suc',case=False),'',df['Stats'])
print(df)
Stats Value stat1
0 Def duels 5.0
1 2.5 Def duels Won
2 Back passes 60.0
3 55.0 Back passes[Acc]
4 Dribbles 5.0
5 2.0 Dribbles[Suc]
我正在重组数据框。示例数据框如下:
df = pd.DataFrame()
df ['Stats'] = ['Def duels', 'Def duels Won','Back passes', 'Back passes[Acc]','Dribbles', 'Dribbles[Suc]']
df ['Value'] = [5,2.5,60,55,5,2]
我想创建一个新列,其中只包含 'Won'、'Acc' 和 'Suc' 等字符串。预期数据框如下:
我该如何解决这个问题?
解决方案:
# initialize Stats1 with empty strings
df['Stats1'] = ''
# copy values from `Stats`
df.iloc[1::2,-1] = df['Stats']
# replace the copied values with empty strings
df['Stats'] = np.where(df['Stats1'].ne(''), '', df['Stats'])
输出:
Stats Value Stats1
0 Def duels 5.0
1 2.5 Def duels Won
2 Back passes 60.0
3 55.0 Back passes[Acc]
4 Dribbles 5.0
5 2.0 Dribbles[Suc]
IIUC
s=df.Stats.str.contains('Won|Acc|Suc')
df['New']=df.Stats.where(s,'')
df.Stats=df.Stats.mask(s,'')
df
Stats Value New
0 Def duels 5.0
1 2.5 Def duels Won
2 Back passes 60.0
3 55.0 Back passes[Acc]
4 Dribbles 5.0
5 2.0 Dribbles[Suc]
使用 str.contains
和 np.where
df['stat1'] = np.where(df['Stats'].str.contains('won|acc|suc',case=False),df['Stats'],'')
df['Stats'] = np.where(df['Stats'].str.contains('won|acc|suc',case=False),'',df['Stats'])
print(df)
Stats Value stat1
0 Def duels 5.0
1 2.5 Def duels Won
2 Back passes 60.0
3 55.0 Back passes[Acc]
4 Dribbles 5.0
5 2.0 Dribbles[Suc]