如何通过 kprobe 将 BPF 程序附加到内核函数?

How can I attached a BPF program to a kernel function via a kprobe?

Cilium BPF and XDP Reference Guide 描述了如何通过 iptc 命令将 BPF 程序加载到网络设备。我如何以相同的方式将 BPF 程序附加到内核 function/userspace 函数?

TL;DR可以使用传统的kprobe API来trace一个函数,然后perf_event_open+ioctl附加一个BPF程序.

这在 bcc 文件 libbpf.cthe load_and_attach function of file load_bpf.c in the kernel, and in the bpf_attach_kprobe and bpf_attach_tracing_event function 中实现。


你可以在跟踪时看到这个动作 the hello_world.py from bcc:

$ strace -s 100 python examples/hello_world.py
[...]
bpf(BPF_PROG_LOAD, {prog_type=BPF_PROG_TYPE_KPROBE, insn_cnt=15, insns=0x7f35716217d0, license="GPL", log_level=0, log_size=0, log_buf=0, kern_version=265728}, 72) = 3
openat(AT_FDCWD, "/sys/bus/event_source/devices/kprobe/type", O_RDONLY) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/sys/bus/event_source/devices/kprobe/format/retprobe", O_RDONLY) = -1 ENOENT (No such file or directory)
openat(AT_FDCWD, "/sys/kernel/debug/tracing/kprobe_events", O_WRONLY|O_APPEND) = 4
getpid()                                = 8121
write(4, "p:kprobes/p_sys_clone_bcc_8121 sys_clone", 40) = 40
close(4)                                = 0
openat(AT_FDCWD, "/sys/kernel/debug/tracing/events/kprobes/p_sys_clone_bcc_8121/id", O_RDONLY) = 4
read(4, "1846\n", 4096)                 = 5
close(4)                                = 0
perf_event_open({type=PERF_TYPE_TRACEPOINT, size=0 /* PERF_ATTR_SIZE_??? */, config=1846, ...}, -1, 0, -1, PERF_FLAG_FD_CLOEXEC) = 4
mmap(NULL, 36864, PROT_READ|PROT_WRITE, MAP_SHARED, 4, 0) = 0x7f356c58b000
ioctl(4, PERF_EVENT_IOC_SET_BPF, 0x3)   = 0
ioctl(4, PERF_EVENT_IOC_ENABLE, 0)      = 0
openat(AT_FDCWD, "/sys/kernel/debug/tracing/trace_pipe", O_RDONLY) = 5
fstat(5, {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
fstat(5, {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
read(5, 
  1. 第一个系统调用 (bpf) 在内核中加载 BPF 程序。
  2. 然后bcc跟随kprobe API通过在p:kprobes/p_sys_clone_bcc_8121 sys_clone中写入p:kprobes/p_sys_clone_bcc_8121 sys_clone来追踪sys_clone
  3. bcc 在 p:kprobes/p_sys_clone_bcc_8121 sys_clone 中检索要在 perf_event_open 中使用的 ID。
  4. 密件抄送调用 perf_event_open,类型为 PERF_TYPE_TRACEPOINT
  5. 并使用 PERF_EVENT_IOC_SET_BPF ioctl.
  6. 将加载的 BPF 程序(由 fd 0x3 引用)附加到 perf_event