如何在 Numpy 中向量化以下循环？

Question

"""Some simulations to predict the future portfolio value based on past distribution. x is 
   a numpy array that contains past returns.The interpolated_returns are the returns 
   generated from the cdf of the past returns to simulate future returns. The portfolio 
   starts with a value of 100. portfolio_value is filled up progressively as 
   the program goes through every loop. The value is multiplied by the returns in that 
   period and a dollar is removed."""

    portfolio_final = []
    for i in range(10000):
        portfolio_value = [100]
        rand_values = np.random.rand(600)
        interpolated_returns = np.interp(rand_values,cdf_values,x)
        interpolated_returns = np.add(interpolated_returns,1)

        for j in range(1,len(interpolated_returns)+1):
            portfolio_value.append(interpolated_returns[j-1]*portfolio_value[j-1])
            portfolio_value[j] = portfolio_value[j]-1

        portfolio_final.append(portfolio_value[-1])
print (np.mean(portfolio_final))

我找不到使用 numpy 编写此代码的方法。我正在查看使用 nditer 的迭代，但我无法继续进行下去。

Answer 1

忘记 np.nditer。它不会提高迭代速度。仅在您打算使用 C 版本（通过 cython）时使用。

我对那个内部循环感到困惑。它应该做什么特别的？为什么循环？

在使用模拟值进行的测试中，这两个代码块产生相同的结果：

interpolated_returns = np.add(interpolated_returns,1)
for j in range(1,len(interpolated_returns)+1):
    portfolio_value.append(interpolated_returns[j-1]*portfolio[j-1])
    portfolio_value[j] = portfolio_value[j]-1

interpolated_returns = (interpolated_returns+1)*portfolio - 1
portfolio_value = portfolio_value + interpolated_returns.tolist()

我假设 interpolated_returns 和 portfolio 是相同长度的一维数组。

Answer 2

我想弄清楚如何将你的东西向量化的最简单方法是查看控制你的演变的方程式，看看你的投资组合实际上是如何迭代的，找到可以向量化的模式而不是尝试向量化你已经拥有的代码。您会注意到 cumprod 实际上经常出现在您的迭代中。

尽管如此，您还是可以在下面找到半矢量化代码。我也包含了您的代码，以便您可以比较结果。我还包含了您代码的一个简单循环版本，它更更易于阅读和转换为数学方程式。因此，如果您与其他人共享此代码，我肯定会使用简单循环选项。如果你想要一些花哨的矢量化，你可以使用矢量版本。如果您需要跟踪单个步骤，您还可以将数组添加到简单循环选项并在每一步附加 pv。

希望对您有所帮助。

编辑：我没有测试任何速度。这是您可以使用 timeit 轻松完成的事情。

import numpy as np
from scipy.special import erf

# Prepare simple return model - Normal distributed with mu &sigma = 0.01
x = np.linspace(-10,10,100)
cdf_values = 0.5*(1+erf((x-0.01)/(0.01*np.sqrt(2))))

# Prepare setup such that every code snippet uses the same number of steps
# and the same random numbers
nSteps = 600
nIterations = 1
rnd = np.random.rand(nSteps)

# Your code - Gives the (supposedly) correct results
portfolio_final = []
for i in range(nIterations):
    portfolio_value = [100]
    rand_values = rnd
    interpolated_returns = np.interp(rand_values,cdf_values,x)
    interpolated_returns = np.add(interpolated_returns,1)

    for j in range(1,len(interpolated_returns)+1):
        portfolio_value.append(interpolated_returns[j-1]*portfolio_value[j-1])
        portfolio_value[j] = portfolio_value[j]-1

    portfolio_final.append(portfolio_value[-1])
print (np.mean(portfolio_final))

# Using vectors
portfolio_final = []
for i in range(nIterations):
    portfolio_values = np.ones(nSteps)*100.0
    rcp = np.cumprod(np.interp(rnd,cdf_values,x) + 1)
    portfolio_values = rcp * (portfolio_values - np.cumsum(1.0/rcp))
    portfolio_final.append(portfolio_values[-1])
print (np.mean(portfolio_final))

# Simple loop
portfolio_final = []
for i in range(nIterations):
    pv = 100
    rets = np.interp(rnd,cdf_values,x) + 1
    for i in range(nSteps):
        pv = pv * rets[i] - 1
    portfolio_final.append(pv)
print (np.mean(portfolio_final))

如何在 Numpy 中向量化以下循环？

How do I vectorize the following loop in Numpy?

python

loops

numpy

vectorization

montecarlo