导入 xml 以查询 Google Sheet

Import xml for query of Google Sheet

当我使用 importxml 应用 class 时,我没有得到任何数据,我已经通过使用右键单击页面源选项查看了源代码并且句子在那里,我使用 ctrl+f 找到该行并尝试如下所述的 xpath 但它给出了#NA,因为新导入 XML google 工作表查询,我需要在单元格 E55 的快照中附加的结果,单元格 I55 中的日文和英文句子& J55.

这些句子来自以下搜索第一个句子在日语和英语中总是被使用,这里是快照

我尝试了一些组合,但在节目#NA;

  1. =IMPORTXML("https://tangorin.com/sentences?search=時","//div[@class='s-jp']")

  2. =IMPORTXML("https://tangorin.com/sentences?search=時","//div[@class='entry entry-border sentences undefined ']/dd[@class='s-jp']")

任何人都可以帮忙,因为我无法复制 2000 个字母的日语和英语句子,我需要翻译 class,非常感谢

输出:

3 个公式来获取数据(url 在单元格 D2 中)。

对于 D4 中的假名:

=TRANSPOSE(SPLIT(SUBSTITUTE(TEXTJOIN("",TRUE,IMPORTXML(D2,"//div[@class='results-main-container']//dt//text()[not(parent::rt)]|//dd[@class='s-en']/@class")),"s-en","¤"),"¤"))

对于 E4 中的罗马字:

=ARRAYFORMULA(TRIM(TRANSPOSE(SPLIT(SUBSTITUTE(TEXTJOIN(" ",TRUE,IMPORTXML(D2,"//div[@class='results-main-container']//dt//rt|//dd[@class='s-en']/@class")),"s-en","¤"),"¤"))))

对于 F4 中的英语:

=ARRAYFORMULA(TRIM(TRANSPOSE(SPLIT(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(REGEXREPLACE(TEXTJOIN(" ",FALSE,IMPORTXML(D2,"//div[@class='results-main-container']//dd/span|(//dd[@class='s-en'])[not(position()=1)]/@class")),"(\w)(  )(\w)","'"),"s-en","¤")," , ",", ")," . ",".")," - ","-")," ( "," (")," ) ",") ")," !","!")," ?","?"),"¤"))))

要将数组限制为 1 个结果,您可以使用类似 :

=INDEX(one of the preceding formulas,1,1)

输出:

编辑:如果您需要这样的东西(单元格中的单词和检索到的第一个示例。/!\ 限制要搜索的单词数。每个单词 = 3 IMPORTXML 请求。因此,对于 20 个单词=> 60 个请求,导致速度变慢 sheet。)

在 B 列中,复制粘贴要搜索的词。

对于单元格 C3、C4、C5...中的假名,以下公式:

=TEXTJOIN("",TRUE,IMPORTXML("https://tangorin.com/sentences?search="&B3,"(//div[@class='results-main-container']//dt)[1]//text()[not(parent::rt)]"))

对于单元格 D3、D4、D5...中的罗马字,以下公式:

=TEXTJOIN(" ",TRUE,IMPORTXML("https://tangorin.com/sentences?search="&B3,"(//div[@class='results-main-container']//dt)[1]//rt"))

对于单元格 E3、E4、E5...中的英语,以下公式:

=SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(REGEXREPLACE(TEXTJOIN(" ",FALSE,IMPORTXML("https://tangorin.com/sentences?search="&B3,"(//div[@class='results-main-container']//dd/span)[1]")),"(\w)(  )(\w)","'"),"s-en","¤")," ,  ",", ")," .",".")," - ","-")," ( "," (")," ) ",") ")," !","!")," ?","?"),". ",".")