在 Python 中缩放正态分布

Question

我想绘制正态分布的直方图，并在其上绘制相应的正态分布。网上有几个关于 y 轴标准化为 density=True 的正态分布的示例。在我的例子中，我试图在没有密度类型归一化的情况下形成正态分布曲线。也许，这可能是一个隐含的数学问题，但我无法弄清楚如何“取消标准化”分布曲线。以下是我的代码：

import numpy as np
from scipy.stats import norm
import matplotlib.pyplot as plt

mu = 1e-3
std = 1.0e-4
nsize = 10000
ymax = 5000

# Generate some data for this demonstration.
data = norm.rvs(mu, std, size=nsize)

# Plot the histogram.
plt.hist(data, bins=20, color='b', edgecolor='black')

# Plot the PDF.
xmin, xmax = [0.5e-3, 1.5e-3] #plt.xlim()
x = np.linspace(xmin, xmax, 100)
p = norm.pdf(x, mu, std)                      # something to do with this line
plt.plot(x, p, 'k', linewidth=2)
plt.axvline(mu, linestyle='dashed', color='black')
plt.ylim([0, ymax])

这会产生以下情节。

可以看出，直方图下方的面积将等于 10000 (nsize)，这是数据点的数量。然而，“分配曲线”却并非如此。如何得到与直方图匹配的曲线？

Answer 1

看起来 plt returns hist 总计 nsize。所以我们可以缩放 p:

# Plot the histogram.
hist, bins, _ = plt.hist(data, bins=20, color='b', edgecolor='black')

# Plot the PDF.
xmin, xmax = [0.5e-3, 1.5e-3] #plt.xlim()

# changes here
p = norm.pdf(bins, mu, std)           
plt.plot(bins, p/p.sum() * nsize , 'r', linewidth=2)

输出：

在 Python 中缩放正态分布

Scaling a normal distribution in Python

python

normal-distribution

matplotlib