问:在 Python 中连续 N 次正面朝上的预期抛硬币次数。我的代码给出的答案与已发布的正确答案不匹配,但不确定原因
Q: Expected number of coin tosses to get N heads in a row, in Python. My code gives answers that don't match published correct ones, but unsure why
我正在尝试编写 Python 代码来查看平均需要抛多少次硬币才能连续出现 N 个正面朝上的序列。
让我感到困惑的是,我的代码生成的答案与在线给出的答案不匹配,例如这里(以及许多其他地方)https://math.stackexchange.com/questions/364038/expected-number-of-coin-tosses-to-get-five-consecutive-heads
据此,我需要连续获得各种正面朝上的预期投掷次数为:E(1) = 2, E(2) = 6, E(3) = 14, E(4) = 30,E(5) = 62。但我没有得到这些答案!例如,我得到 E(3) = 8,而不是 14。下面的代码 运行s 给出了答案,但您可以更改 n 以测试连续出现的其他目标数量。
出了什么问题?估计是我的代码逻辑有问题,但我承认我搞不清楚是什么。
您可以在此处查看 运行 并修改我的代码副本:https://trinket.io/python/17154b2cbd
下面是代码本身,在 运行nable trinket.io 页面之外。如果您能帮助找出问题所在,我们将不胜感激!
非常感谢,
拉吉
P.S。我能找到的最接近的相关问题是这个:
但是,据我所知,该问题中的代码实际上并没有测试两个 consecutive 头,而是测试一个以头开始然后在稍后的序列,可能是非连续的,时间得到另一个头。
# Click here to run and/or modify this code:
# https://trinket.io/python/17154b2cbd
import random
# n is the target number of heads in a row
# Change the value of n, for different target heads-sequences
n = 3
possible_tosses = [ 'h', 't' ]
num_trials = 1000
target_seq = ['h' for i in range(0,n)]
toss_sequence = []
seq_lengths_rec = []
for trial_num in range(0,num_trials):
if (trial_num % 100) == 0:
print 'Trial num', trial_num, 'out of', num_trials
# (The free version of trinket.io uses Python2)
target_reached = 0
toss_num = 0
while target_reached == 0:
toss_num += 1
random.shuffle(possible_tosses)
this_toss = possible_tosses[0]
#print([toss_num, this_toss])
toss_sequence.append(this_toss)
last_n_tosses = toss_sequence[-n:]
#print(last_n_tosses)
if last_n_tosses == target_seq:
#print('Reached target at toss', toss_num)
target_reached = 1
seq_lengths_rec.append(toss_num)
print 'Average', sum(seq_lengths_rec) / len(seq_lengths_rec)
您不会为每个实验重新初始化 toss_sequence
,因此您使用预先存在的正面序列开始每个实验,在第一次尝试时有二分之一的机会击中目标序列每个新实验。
在外循环中初始化 toss_sequence
将解决您的问题:
import random
# n is the target number of heads in a row
# Change the value of n, for different target heads-sequences
n = 4
possible_tosses = [ 'h', 't' ]
num_trials = 1000
target_seq = ['h' for i in range(0,n)]
seq_lengths_rec = []
for trial_num in range(0,num_trials):
if (trial_num % 100) == 0:
print('Trial num {} out of {}'.format(trial_num, num_trials))
# (The free version of trinket.io uses Python2)
target_reached = 0
toss_num = 0
toss_sequence = []
while target_reached == 0:
toss_num += 1
random.shuffle(possible_tosses)
this_toss = possible_tosses[0]
#print([toss_num, this_toss])
toss_sequence.append(this_toss)
last_n_tosses = toss_sequence[-n:]
#print(last_n_tosses)
if last_n_tosses == target_seq:
#print('Reached target at toss', toss_num)
target_reached = 1
seq_lengths_rec.append(toss_num)
print(sum(seq_lengths_rec) / len(seq_lengths_rec))
您可以稍微简化您的代码,使其不易出错:
import random
# n is the target number of heads in a row
# Change the value of n, for different target heads-sequences
n = 3
possible_tosses = [ 'h', 't' ]
num_trials = 1000
seq_lengths_rec = []
for trial_num in range(0, num_trials):
if (trial_num % 100) == 0:
print('Trial num {} out of {}'.format(trial_num, num_trials))
# (The free version of trinket.io uses Python2)
heads_counter = 0
toss_counter = 0
while heads_counter < n:
toss_counter += 1
this_toss = random.choice(possible_tosses)
if this_toss == 'h':
heads_counter += 1
else:
heads_counter = 0
seq_lengths_rec.append(toss_counter)
print(sum(seq_lengths_rec) / len(seq_lengths_rec))
我们可以通过 运行 每个实验足够长(理想情况下是无限次)的次数来消除一个额外的循环,例如,每次抛硬币 n=1000 次。现在,很可能在每次这样的试验中都会出现 5 个正面朝上的序列。如果确实出现了,我们可以称该试验为有效试验,否则我们可以拒绝该试验。
最后,我们可以取平均需要的投掷次数w.r.t。有效试验的次数(通过 LLN 它将近似于预期的投掷次数)。考虑以下代码:
N = 100000 # total number of trials
n = 1000 # long enough sequence of tosses
k = 5 # k heads in a row
ntosses = []
pat = ''.join(['1']*k)
effective_trials = 0
for i in range(N): # num of trials
seq = ''.join(map(str,random.choices(range(2),k=n))) # toss a coin n times (long enough times)
if pat in seq:
ntosses.append(seq.index(pat) + k)
effective_trials += 1
print(effective_trials, sum(ntosses) / effective_trials)
# 100000 62.19919
请注意,如果 n 较小,结果可能不正确,因为它试图近似无限次抛硬币(要找到预期的抛硬币次数以获得连续 5 次正面朝上,n=1000 是可以的,因为实际预期值为 62)。
我正在尝试编写 Python 代码来查看平均需要抛多少次硬币才能连续出现 N 个正面朝上的序列。
让我感到困惑的是,我的代码生成的答案与在线给出的答案不匹配,例如这里(以及许多其他地方)https://math.stackexchange.com/questions/364038/expected-number-of-coin-tosses-to-get-five-consecutive-heads
据此,我需要连续获得各种正面朝上的预期投掷次数为:E(1) = 2, E(2) = 6, E(3) = 14, E(4) = 30,E(5) = 62。但我没有得到这些答案!例如,我得到 E(3) = 8,而不是 14。下面的代码 运行s 给出了答案,但您可以更改 n 以测试连续出现的其他目标数量。
出了什么问题?估计是我的代码逻辑有问题,但我承认我搞不清楚是什么。
您可以在此处查看 运行 并修改我的代码副本:https://trinket.io/python/17154b2cbd
下面是代码本身,在 运行nable trinket.io 页面之外。如果您能帮助找出问题所在,我们将不胜感激!
非常感谢,
拉吉
P.S。我能找到的最接近的相关问题是这个:
# Click here to run and/or modify this code:
# https://trinket.io/python/17154b2cbd
import random
# n is the target number of heads in a row
# Change the value of n, for different target heads-sequences
n = 3
possible_tosses = [ 'h', 't' ]
num_trials = 1000
target_seq = ['h' for i in range(0,n)]
toss_sequence = []
seq_lengths_rec = []
for trial_num in range(0,num_trials):
if (trial_num % 100) == 0:
print 'Trial num', trial_num, 'out of', num_trials
# (The free version of trinket.io uses Python2)
target_reached = 0
toss_num = 0
while target_reached == 0:
toss_num += 1
random.shuffle(possible_tosses)
this_toss = possible_tosses[0]
#print([toss_num, this_toss])
toss_sequence.append(this_toss)
last_n_tosses = toss_sequence[-n:]
#print(last_n_tosses)
if last_n_tosses == target_seq:
#print('Reached target at toss', toss_num)
target_reached = 1
seq_lengths_rec.append(toss_num)
print 'Average', sum(seq_lengths_rec) / len(seq_lengths_rec)
您不会为每个实验重新初始化 toss_sequence
,因此您使用预先存在的正面序列开始每个实验,在第一次尝试时有二分之一的机会击中目标序列每个新实验。
在外循环中初始化 toss_sequence
将解决您的问题:
import random
# n is the target number of heads in a row
# Change the value of n, for different target heads-sequences
n = 4
possible_tosses = [ 'h', 't' ]
num_trials = 1000
target_seq = ['h' for i in range(0,n)]
seq_lengths_rec = []
for trial_num in range(0,num_trials):
if (trial_num % 100) == 0:
print('Trial num {} out of {}'.format(trial_num, num_trials))
# (The free version of trinket.io uses Python2)
target_reached = 0
toss_num = 0
toss_sequence = []
while target_reached == 0:
toss_num += 1
random.shuffle(possible_tosses)
this_toss = possible_tosses[0]
#print([toss_num, this_toss])
toss_sequence.append(this_toss)
last_n_tosses = toss_sequence[-n:]
#print(last_n_tosses)
if last_n_tosses == target_seq:
#print('Reached target at toss', toss_num)
target_reached = 1
seq_lengths_rec.append(toss_num)
print(sum(seq_lengths_rec) / len(seq_lengths_rec))
您可以稍微简化您的代码,使其不易出错:
import random
# n is the target number of heads in a row
# Change the value of n, for different target heads-sequences
n = 3
possible_tosses = [ 'h', 't' ]
num_trials = 1000
seq_lengths_rec = []
for trial_num in range(0, num_trials):
if (trial_num % 100) == 0:
print('Trial num {} out of {}'.format(trial_num, num_trials))
# (The free version of trinket.io uses Python2)
heads_counter = 0
toss_counter = 0
while heads_counter < n:
toss_counter += 1
this_toss = random.choice(possible_tosses)
if this_toss == 'h':
heads_counter += 1
else:
heads_counter = 0
seq_lengths_rec.append(toss_counter)
print(sum(seq_lengths_rec) / len(seq_lengths_rec))
我们可以通过 运行 每个实验足够长(理想情况下是无限次)的次数来消除一个额外的循环,例如,每次抛硬币 n=1000 次。现在,很可能在每次这样的试验中都会出现 5 个正面朝上的序列。如果确实出现了,我们可以称该试验为有效试验,否则我们可以拒绝该试验。
最后,我们可以取平均需要的投掷次数w.r.t。有效试验的次数(通过 LLN 它将近似于预期的投掷次数)。考虑以下代码:
N = 100000 # total number of trials
n = 1000 # long enough sequence of tosses
k = 5 # k heads in a row
ntosses = []
pat = ''.join(['1']*k)
effective_trials = 0
for i in range(N): # num of trials
seq = ''.join(map(str,random.choices(range(2),k=n))) # toss a coin n times (long enough times)
if pat in seq:
ntosses.append(seq.index(pat) + k)
effective_trials += 1
print(effective_trials, sum(ntosses) / effective_trials)
# 100000 62.19919
请注意,如果 n 较小,结果可能不正确,因为它试图近似无限次抛硬币(要找到预期的抛硬币次数以获得连续 5 次正面朝上,n=1000 是可以的,因为实际预期值为 62)。