我应该怎么做而不是在 CuPy 中使用 numpy.vectorize？

Question

我应该如何将定义的函数应用于 cupy.array 而不是 np.vectorize？ cupy 有类似的功能吗？

我在Python3.6.9.

中编写模拟程序

我想用 CuPy（6.0.0 for CUDA10.1）在 GPU（GTX1060，NVIDIA）中进行模拟。

在原始代码中，函数numpy.vectorize 用于将定义的函数应用到np.array。然而，同样的功能还没有在 CuPy 中实现。

原代码（使用numpy）如下：

#For define function
def rate(tmean,x,y,z):
    rate = 1/z/(1 + math.exp(-x*(tmean-y)))
    #DVR<0
    if rate < 0:
        rate = 0
    return rate

#tmean is temperature data(365,100,100) and loaded as np.array
#paras is parameter(3,100,100)
#vectorized
f = np.vectorize(rate)
#roop
for i in range(365):
    #calc developing rate(by function "rate") and accumulate
    dvi[i,:,:] = dvi[i-1,:,:] + f(tmean[i,:,:],paras[0],paras[1],paras[2])

我知道 CuPy 中几乎已经实现了 numpy 的功能。所以我改变了

f = np.vectorized(rate)

到

f= cp.vectorized(rate)

但是发生了AttributeError。

Answer 1

GPU 无法并行化任意 Python 代码。用 NumPy 兼容的操作编写所有内容，例如

def rate_(xp, tmean,x,y,z):
    rate = 1/z/(1 + xp.exp(-x*(tmean-y)))
    rate[rate < 0] = 0
    return rate

f = functools.partial(rate_, xp=cupy)

要加快速度，您可以使用 cupy.ElementwiseKernel (https://docs-cupy.chainer.org/en/stable/tutorial/kernel.html)，它会为矢量化操作创建单个内核。

f = cupy.ElementwiseKernel(
    'T tmean, T x, T y, T z',
    'T rate',
    '''
    rate = 1/z/(1 + exp(-x*(tmean-y)));
    // DVR<0
    if (rate < 0) {
        rate = 0;
    }
    '''
)

要从 Python 代码创建内核，请尝试 cupy.fuse。

@cupy.fuse()
def f(tmean,x,y,z):
    rate = 1/z/(1 + cupy.exp(-x*(tmean-y)))
    return cupy.where(rate < 0, 0, rate)  # __setitem__ is not fully supported

我应该怎么做而不是在 CuPy 中使用 numpy.vectorize？

How should I do instead of using numpy.vectorize in CuPy?

python

cupy