使用 numpy.frombuffer 读取 cffi.buffer 时如何处理 C 结构中的成员填充？

Question

我必须读取从 dll 返回的 C 结构数组并将其转换为 Numpy 数组。该代码使用 Python 的 cffi 模块。

代码到目前为止有效，但我不知道如何处理 np.frombuffer 抱怨的结构中的成员填充：

ValueError: buffer size must be a multiple of element size

这是我的代码：

from cffi import FFI
import numpy as np

s = '''
    typedef struct
    {
        int a;
        int b;
        float c;
        double d;
    } mystruct;
    '''

ffi = FFI()
ffi.cdef(s)

res = []

#create array and fill with dummy data
for k in range(2):

    m = ffi.new("mystruct *")

    m.a = k
    m.b = k + 1
    m.c = k + 2.0
    m.d = k + 3.0

res.append(m[0])

m_arr = ffi.new("mystruct[]", res)

print(m_arr)

# dtype for structured array in Numpy
dt = [('a', 'i4'),
      ('b', 'i4'),
      ('c', 'f4'),
      ('d', 'f8')]

# member size, 20 bytes
print('size, manually', 4 + 4 + 4 + 8)

# total size of struct, 24 bytes
print('sizeof', ffi.sizeof(m_arr[0]))

#reason is member padding in structs

buf = ffi.buffer(m_arr)
print(buf)

x = np.frombuffer(buf, dtype=dt)
print(x)

有什么想法可以干净地处理这个问题吗？

编辑：

如果我在应该发生填充的 dtype 中添加一个额外的数字，它似乎可以工作：

dt = [('a', 'i4'),
      ('b', 'i4'),
      ('c', 'f4'),
      ('pad', 'f4'),
      ('d', 'f8')]

为什么填充发生在那里？（Win7，64 位，Python 3.4 64 位）。

但这不是最好的方法。真正的代码要复杂和动态得多，所以应该可以以某种方式处理这个问题，对吗？

Answer 1

除了评论中给出的其他答案之外，您还可以强制 cffi 打包其结构（即不插入任何填充，类似于您可以使用特定的 C 编译器扩展执行的操作）：

ffi.cdef("typedef struct { char a; int b; } foo_t;", packed=True)

Answer 2

可能最方便的方法是在 numpy dtype constructor 中使用关键字 align=True。这将自动填充。

dt = [('a', 'i4'),
      ('b', 'i4'),
      ('c', 'f4'),
      ('d', 'f8')]

dt_obj = np.dtype(dt, align=True)
x = np.frombuffer(buf, dtype=dt_obj)

（另见 Numpy doc 结构化数组）

使用 numpy.frombuffer 读取 cffi.buffer 时如何处理 C 结构中的成员填充？

How to handle member padding in C struct when reading cffi.buffer with numpy.frombuffer?

c

python

struct

numpy

python-cffi

编辑：