根据前 2 列合并 2 个列表
merge 2 lists based on first 2 columns
我需要根据第 1 列和第 2 列合并 2 个列表
文件 1:
client1,server1,3000.00
client1,server2,2500.00
client1,server3,1500.00
client2,server1,4500.00
client2,server2,2300.00
client2,server3,1230.00
client3,server1,3400.00
client3,server2,4500.00
client3,server3,1245.00
client4,server1,3400.00
client5,server2,4500.00
client6,server3,1245.00
client7,server1,3400.00
client7,server2,4500.00
client8,server3,1245.00
client8,server1,3400.00
client8,server2,4500.00
client9,server3,1245.00
文件 2:
client1,server1,windows,250g
client1,server2,linux,450g
client1,server3,linux,400g
client2,server1,windows,250g
client2,server2,linux,450g
client2,server3,linux,400g
client3,server1,windows,250g
client3,server2,linux,450g
client3,server3,linux,400g
我需要的是用 file1 的第 1 列和第 2 列中的缺失值更新 file2,并添加逗号以保持相同的列数
对于这个例子,输出应该是这样的:
client1,server1,windows,250g
client1,server2,linux,450g
client1,server3,linux,400g
client2,server1,windows,250g
client2,server2,linux,450g
client2,server3,linux,400g
client3,server1,windows,250g
client3,server2,linux,450g
client3,server3,linux,400g
client4,server1,,
client5,server2,,
client6,server3,,
client7,server1,,
client7,server2,,
client8,server3,,
client8,server1,,
client8,server2,,
client9,server3,,
我尝试过使用 awk 和 join,但无法获得相同的结果
如果创建新文件更容易,那就没问题
感谢您的帮助
试试这一行:
awk -F, '{k= FS }NR==FNR{a[k]++;print;next}!a[k]{print k",,"}' file2 file1
另一种 awk 方式
awk -F, -vOFS="," 'NR!=FNR{NF--;NF+=2}!a[ FS ]++' test2 test
或
awk -F, 'NR!=FNR{[=11=]= FS ",,"}!a[ FS ]++' test2 test
最短
awk -F, '{x=","}NR!=FNR{[=12=]=x",,"}!a[x]++' test2 test
使用join
命令。问题是 join
不能连接多个字段,所以我们需要临时操作第一个逗号:
join -t , -o 0,2.2,2.3 -a 1 <(sed 's/,/:/' file1) <(sed 's/,/:/' file2) | sed 's/:/,/'
我需要根据第 1 列和第 2 列合并 2 个列表
文件 1:
client1,server1,3000.00
client1,server2,2500.00
client1,server3,1500.00
client2,server1,4500.00
client2,server2,2300.00
client2,server3,1230.00
client3,server1,3400.00
client3,server2,4500.00
client3,server3,1245.00
client4,server1,3400.00
client5,server2,4500.00
client6,server3,1245.00
client7,server1,3400.00
client7,server2,4500.00
client8,server3,1245.00
client8,server1,3400.00
client8,server2,4500.00
client9,server3,1245.00
文件 2:
client1,server1,windows,250g
client1,server2,linux,450g
client1,server3,linux,400g
client2,server1,windows,250g
client2,server2,linux,450g
client2,server3,linux,400g
client3,server1,windows,250g
client3,server2,linux,450g
client3,server3,linux,400g
我需要的是用 file1 的第 1 列和第 2 列中的缺失值更新 file2,并添加逗号以保持相同的列数
对于这个例子,输出应该是这样的:
client1,server1,windows,250g
client1,server2,linux,450g
client1,server3,linux,400g
client2,server1,windows,250g
client2,server2,linux,450g
client2,server3,linux,400g
client3,server1,windows,250g
client3,server2,linux,450g
client3,server3,linux,400g
client4,server1,,
client5,server2,,
client6,server3,,
client7,server1,,
client7,server2,,
client8,server3,,
client8,server1,,
client8,server2,,
client9,server3,,
我尝试过使用 awk 和 join,但无法获得相同的结果
如果创建新文件更容易,那就没问题
感谢您的帮助
试试这一行:
awk -F, '{k= FS }NR==FNR{a[k]++;print;next}!a[k]{print k",,"}' file2 file1
另一种 awk 方式
awk -F, -vOFS="," 'NR!=FNR{NF--;NF+=2}!a[ FS ]++' test2 test
或
awk -F, 'NR!=FNR{[=11=]= FS ",,"}!a[ FS ]++' test2 test
最短
awk -F, '{x=","}NR!=FNR{[=12=]=x",,"}!a[x]++' test2 test
使用join
命令。问题是 join
不能连接多个字段,所以我们需要临时操作第一个逗号:
join -t , -o 0,2.2,2.3 -a 1 <(sed 's/,/:/' file1) <(sed 's/,/:/' file2) | sed 's/:/,/'