我如何使用 google ngram 查看器和 python 得到句子的出现？

Question

简短背景：我尝试通过 Peter Norvig in python. In this sense i need the occurrence of a sentence (up to 3-4 words)... The Ngram viewer from Google 增强拼写校正器，这对我有很大帮助，但我不知道我如何通过 API 或其他东西获得价值。

伪代码：

# Sentence without meaning but word for word correct.
>> occurrence("were are you")
0.0000000978

# Sentence that makes sense
>> occurrence("where are you")
0.000148

# Then my method should return the sentence with the highest value. (But thats not the problem)

对不起我的英语:-D 谢谢！

Answer 1

他们实际上有一个未记录的api。

import requests
import json

term = "where are you"
url =f"https://books.google.com/ngrams/json?content={term}&year_start=1800&year_end=2000&corpus=26&smoothing=3"
resp = requests.get(url)
if resp.ok:
  results = json.loads(resp.content)

results[0]['timeseries'] 有您需要的频率：

[2.854326695000964e-07,
 3.4926038665616944e-07,
 3.3916604043800663e-07,
 ...]

来源：https://jameshfisher.com/2018/11/25/google-ngram-api/

我如何使用 google ngram 查看器和 python 得到句子的出现？

How i get the occurrence of a sentence with google ngram viewer and python?

python

nlp

n-gram