Numpy/CAPI 编译多个模块时 import_array() 出错

Question

我正在尝试编译一个 C++ 模块以在 scipy.weave 中使用，它由几个头文件和源 C++ 文件组成。这些文件包含类和广泛使用 Numpy/C-API 接口的方法。但是我没有弄清楚如何成功地包含 import_array() 。过去一周我一直在为此苦苦挣扎，我快疯了。我希望你能帮助我，因为 weave help 不是很清楚。

实际上，我首先有一个名为 pycapi_utils 的模块，其中包含一些例程，用于将 C 对象与 Python 对象连接起来。它由一个头文件pycapi_utils.h和一个源文件pycapi_utils.cpp组成如：

//pycapi_utils.h
#if ! defined _PYCAPI_UTILS_H
#define _PYCAPI_UTILS_H 1

#include <stdlib.h>
#include <Python.h>
#include <numpy/arrayobject.h>
#include <tuple>
#include <list>

typedef std::tuple<const char*,PyObject*> pykeyval; //Tuple type (string,Pyobj*) as dictionary entry (key,val)
typedef std::list<pykeyval> kvlist;                    

//Declaration of methods
PyObject* array_double_to_pyobj(double* v_c, long int NUMEL); //Convert from array to Python list (double)
...
...
#endif

和

//pycapi_utils.cpp

#include "pycapi_utils.h"

PyObject* array_double_to_pyobj(double* v_c, long int NUMEL){
    //Convert a double array to a Numpy array
    PyObject* out_array = PyArray_SimpleNew(1, &NUMEL, NPY_DOUBLE);
    double* v_b = (double*) ((PyArrayObject*) out_array)->data;
    for (int i=0;i<NUMEL;i++) v_b[i] = v_c[i];
    free(v_c);
    return out_array;
}

然后我还有一个模块 model，其中包含类和处理某些数学模型的例程。同样，它由一个头文件和源文件组成，如：

//model.h
#if ! defined _MODEL_H
#define _MODEL_H 1

//model class
class my_model{
    int i,j;
    public:
        my_model();
        ~my_model();
        double* update(double*); 
}

//Simulator   
PyObject* simulate(double* input);
#endif

和

//model.cpp

#include "pycapi_utils.h"
#include "model.h"

//Define class and methods
model::model{
...
...
}

...
...

double* model::update(double* input){
    double* x = (double*)calloc(N,sizeof(double));
    ...
    ...

    // Do something
    ...
    ...

    return x;
}

PyObject* simulate(double* input){
    //Initialize Python interface 
    Py_Initialize;
    import_array();

    model random_network;
    double* output;

    output = random_network.update(input);
    return array_double_to_pyobj(output);  // from pycapi_utils.h     
}

以上代码包含在 Python 和

的 scipy.weave 模块中

def model_py(input):
    support_code="""
                 #include "model.h"
                 """
    code = """
           return_val = simulate(input.data());
           """
    libs=['gsl','gslcblas','m']
    vars = ['input']
    out = weave.inline(code,
                       vars,
                       support_code=support_code,
                       sources = source_files,
                       libraries=libs
                       type_converters=converters.blitz,
                       compiler='gcc',
                       extra_compile_args=['-std=c++11'],
                       force=1)

编译失败给出：

error: int _import_array() was not declared in this scope

值得注意的是，如果我将 pycapi_utils.h 和来源 pycapi_utils.cpp 混为一谈，一切正常。但是我不想使用这个解决方案，因为实际上我这里的模块需要包含在其他几个也使用 PyObjects 的模块中并且需要调用 import_array().

我正在寻找有关堆栈交换的，但我无法弄清楚在我的案例中是否以及如何正确定义 #define 指令。此外，post 中的示例并不完全是我的情况，因为 import_array() 在 main() 的全局范围内被调用，而在我的情况下 import_array() 在我的 simulate 由 scipy.weave 构建的 main() 调用的例程。

Answer 1

显然，如果我在 pycapi_utils 模块中包含一个简单的初始化例程，例如：

//pycapi_utils.h
...
...
void init_numpy();

//pycapi_utils.cpp
...
...
void init_numpy(){
    Py_Initialize;
    import_array();
}

然后我在我的 C 代码中使用 Numpy 对象的任何函数/方法的开头调用此例程，它有效。即以上代码编辑为：

//pycapi_utils.cpp
...
...
PyObject* array_double_to_pyobj(...){
    init_numpy();
    ...
    ...
}


//model.cpp
...
...
PyObject* simulate(...){
    init_numpy();
    ...
    ...
}

此时我唯一关心的是是否有一种方法可以最大限度地减少对 init_numpy() 的调用次数，或者无论我必须从我在使用 Numpy 对象的 CPP 模块中定义的任何函数调用它...

Answer 2

我有一个类似的问题，正如你发布的 link 指出的那样，万恶的根源是 PyArray_API 被定义为静态的，这意味着每个翻译单元都有它的自己的 PyArray_API 默认情况下用 PyArray_API = NULL 初始化。因此，必须为每个 *.cpp 文件调用一次 import_array()。在你的情况下，在 pycapi_utils.cpp 中调用它应该就足够了，并且在 model.cpp 中调用一次就足够了。您还可以在实际调用之前测试是否需要 array_import：

if(PyArray_API == NULL)
{
    import_array(); 
}

Numpy/CAPI 编译多个模块时 import_array() 出错

Numpy/CAPI error with import_array() when compiling multiple modules

c

c++

python

numpy

python-c-api