在具有相似文件的目录中进行尾部操作
Tail operation in directories that have similar files under
我正在寻找处理这种情况的聪明方法。
在cpu-0
和fhcount
目录下,有很多文件,但可以按一些名称进行分组。
这是目录和文件的鸟瞰图。
../cpu-0/
cpu-idle-01-01-2016
cpu-idle-02-01-2016
cpu-idle-03-01-2016
.
.
cpu-interrupt-01-01-2016
cpu-interrupt-02-01-2016
cpu-interrupt-03-01-2016
.
.
.
cpu-nice-01-01-2016
cpu-nice-02-01-2016
.
.
../fhcount/
file_handles-max-01-01-2016
file_handles-max-02-01-2016
file_handles-max-03-01-2016
.
.
file_handles-unused-01-01-2016
file_handles-unused-02-01-2016
file_handles-unused-03-01-2016
.
.
.
file_handles-used-01-01-2016
file_handles-used-02-01-2016
.
.
如您所见,这是有规律的。我通过硬编码收集了它们,以便 tail
相关文件。
curdir="${PWD%}"
tail -q -n +2 $curdir/cpu-0/cpu-idle* > cpu-idle_combined
tail -q -n +2 $curdir/cpu-0/cpu-interrupt* > cpu-interrupt_combined
tail -q -n +2 $curdir/cpu-0/cpu-nice* > cpu-nice_combined
tail -q -n +2 $curdir/fhcount/file_handles-max* > file_handles-max_combined
tail -q -n +2 $curdir/fhcount/file_handles-unused-* > file_handles-unused_combined
tail -q -n +2 $curdir/fhcount/file_handles-used-* > file_handles-unused_combined
我怎么能做同样的事情,但这次更聪明?
这会遍历子目录中的所有文件,收集文件名的公共部分,然后将它们打印到合并的输出文件中:
#!/bin/bash
# Required for the +(pattern) glob
shopt -s extglob
# Associative array used as set of unique file name roots
declare -A roots
# Shorten names like cpu-0/cpu-idle-01-01-2016 to cpu-0/cpu-idle
# +([[:digit:]-]) matches digits and hyphens
# ${fname%%pattern) removes the longest match of pattern from the end of fname
for fname in */*; do
roots["${fname%%+([[:digit:]-])}"]=1
done
# Loop through unique roots, print to output files
for fname in "${!roots[@]}"; do
tail -q -n +2 "$fname"* > "$fname"_combined
done
关联数组需要 Bash 4.0 或更新版本。
例如
的输入文件结构
.
├── cpu-0
│ ├── cpu-idle-01-01-2016
│ ├── cpu-idle-02-01-2016
│ ├── cpu-idle-03-01-2016
│ ├── cpu-interrupt-01-01-2016
│ ├── cpu-interrupt-02-01-2016
│ ├── cpu-interrupt-03-01-2016
│ ├── cpu-nice-01-01-2016
│ ├── cpu-nice-02-01-2016
│ └── cpu-nice-03-01-2016
└── fhcount
├── file_handles-max-01-01-2016
├── file_handles-max-02-01-2016
├── file_handles-max-03-01-2016
├── file_handles-unused-01-01-2016
├── file_handles-unused-02-01-2016
├── file_handles-unused-03-01-2016
├── file_handles-used-01-01-2016
├── file_handles-used-02-01-2016
└── file_handles-used-03-01-2016
最终的输出结构是
.
├── cpu-0
│ ├── cpu-idle-01-01-2016
│ ├── cpu-idle-02-01-2016
│ ├── cpu-idle-03-01-2016
│ ├── cpu-idle_combined
│ ├── cpu-interrupt-01-01-2016
│ ├── cpu-interrupt-02-01-2016
│ ├── cpu-interrupt-03-01-2016
│ ├── cpu-interrupt_combined
│ ├── cpu-nice-01-01-2016
│ ├── cpu-nice-02-01-2016
│ ├── cpu-nice-03-01-2016
│ └── cpu-nice_combined
└── fhcount
├── file_handles-max-01-01-2016
├── file_handles-max-02-01-2016
├── file_handles-max-03-01-2016
├── file_handles-max_combined
├── file_handles-unused-01-01-2016
├── file_handles-unused-02-01-2016
├── file_handles-unused-03-01-2016
├── file_handles-unused_combined
├── file_handles-used-01-01-2016
├── file_handles-used-02-01-2016
├── file_handles-used-03-01-2016
└── file_handles-used_combined
例如输入文件内容如
$ head cpu-idle*
==> cpu-idle-01-01-2016 <==
1cpu-idle-01-01-2016
2cpu-idle-01-01-2016
3cpu-idle-01-01-2016
==> cpu-idle-02-01-2016 <==
1cpu-idle-02-01-2016
2cpu-idle-02-01-2016
3cpu-idle-02-01-2016
==> cpu-idle-03-01-2016 <==
1cpu-idle-03-01-2016
2cpu-idle-03-01-2016
3cpu-idle-03-01-2016
合并的输出文件包含
$ cat cpu-idle_combined
2cpu-idle-01-01-2016
3cpu-idle-01-01-2016
2cpu-idle-02-01-2016
3cpu-idle-02-01-2016
2cpu-idle-03-01-2016
3cpu-idle-03-01-2016
我正在寻找处理这种情况的聪明方法。
在cpu-0
和fhcount
目录下,有很多文件,但可以按一些名称进行分组。
这是目录和文件的鸟瞰图。
../cpu-0/
cpu-idle-01-01-2016
cpu-idle-02-01-2016
cpu-idle-03-01-2016
.
.
cpu-interrupt-01-01-2016
cpu-interrupt-02-01-2016
cpu-interrupt-03-01-2016
.
.
.
cpu-nice-01-01-2016
cpu-nice-02-01-2016
.
.
../fhcount/
file_handles-max-01-01-2016
file_handles-max-02-01-2016
file_handles-max-03-01-2016
.
.
file_handles-unused-01-01-2016
file_handles-unused-02-01-2016
file_handles-unused-03-01-2016
.
.
.
file_handles-used-01-01-2016
file_handles-used-02-01-2016
.
.
如您所见,这是有规律的。我通过硬编码收集了它们,以便 tail
相关文件。
curdir="${PWD%}"
tail -q -n +2 $curdir/cpu-0/cpu-idle* > cpu-idle_combined
tail -q -n +2 $curdir/cpu-0/cpu-interrupt* > cpu-interrupt_combined
tail -q -n +2 $curdir/cpu-0/cpu-nice* > cpu-nice_combined
tail -q -n +2 $curdir/fhcount/file_handles-max* > file_handles-max_combined
tail -q -n +2 $curdir/fhcount/file_handles-unused-* > file_handles-unused_combined
tail -q -n +2 $curdir/fhcount/file_handles-used-* > file_handles-unused_combined
我怎么能做同样的事情,但这次更聪明?
这会遍历子目录中的所有文件,收集文件名的公共部分,然后将它们打印到合并的输出文件中:
#!/bin/bash
# Required for the +(pattern) glob
shopt -s extglob
# Associative array used as set of unique file name roots
declare -A roots
# Shorten names like cpu-0/cpu-idle-01-01-2016 to cpu-0/cpu-idle
# +([[:digit:]-]) matches digits and hyphens
# ${fname%%pattern) removes the longest match of pattern from the end of fname
for fname in */*; do
roots["${fname%%+([[:digit:]-])}"]=1
done
# Loop through unique roots, print to output files
for fname in "${!roots[@]}"; do
tail -q -n +2 "$fname"* > "$fname"_combined
done
关联数组需要 Bash 4.0 或更新版本。
例如
的输入文件结构.
├── cpu-0
│ ├── cpu-idle-01-01-2016
│ ├── cpu-idle-02-01-2016
│ ├── cpu-idle-03-01-2016
│ ├── cpu-interrupt-01-01-2016
│ ├── cpu-interrupt-02-01-2016
│ ├── cpu-interrupt-03-01-2016
│ ├── cpu-nice-01-01-2016
│ ├── cpu-nice-02-01-2016
│ └── cpu-nice-03-01-2016
└── fhcount
├── file_handles-max-01-01-2016
├── file_handles-max-02-01-2016
├── file_handles-max-03-01-2016
├── file_handles-unused-01-01-2016
├── file_handles-unused-02-01-2016
├── file_handles-unused-03-01-2016
├── file_handles-used-01-01-2016
├── file_handles-used-02-01-2016
└── file_handles-used-03-01-2016
最终的输出结构是
.
├── cpu-0
│ ├── cpu-idle-01-01-2016
│ ├── cpu-idle-02-01-2016
│ ├── cpu-idle-03-01-2016
│ ├── cpu-idle_combined
│ ├── cpu-interrupt-01-01-2016
│ ├── cpu-interrupt-02-01-2016
│ ├── cpu-interrupt-03-01-2016
│ ├── cpu-interrupt_combined
│ ├── cpu-nice-01-01-2016
│ ├── cpu-nice-02-01-2016
│ ├── cpu-nice-03-01-2016
│ └── cpu-nice_combined
└── fhcount
├── file_handles-max-01-01-2016
├── file_handles-max-02-01-2016
├── file_handles-max-03-01-2016
├── file_handles-max_combined
├── file_handles-unused-01-01-2016
├── file_handles-unused-02-01-2016
├── file_handles-unused-03-01-2016
├── file_handles-unused_combined
├── file_handles-used-01-01-2016
├── file_handles-used-02-01-2016
├── file_handles-used-03-01-2016
└── file_handles-used_combined
例如输入文件内容如
$ head cpu-idle*
==> cpu-idle-01-01-2016 <==
1cpu-idle-01-01-2016
2cpu-idle-01-01-2016
3cpu-idle-01-01-2016
==> cpu-idle-02-01-2016 <==
1cpu-idle-02-01-2016
2cpu-idle-02-01-2016
3cpu-idle-02-01-2016
==> cpu-idle-03-01-2016 <==
1cpu-idle-03-01-2016
2cpu-idle-03-01-2016
3cpu-idle-03-01-2016
合并的输出文件包含
$ cat cpu-idle_combined
2cpu-idle-01-01-2016
3cpu-idle-01-01-2016
2cpu-idle-02-01-2016
3cpu-idle-02-01-2016
2cpu-idle-03-01-2016
3cpu-idle-03-01-2016