从网站 Google 表格导入数据的正则表达式提取

Regexextract of importdata from website GoogleSheets

目的是提取title and tags from a webpage.

我正在使用 importdata,我希望所有结果都在 1 行中。像这样:

[webpage] [title] [1st tag] [2nd tag] [3 rd tag] [4th tag] ... [last tag]

我卡在了一半my process in googlesheet

我不知道如何走得更远:(任何帮助表示赞赏!

=ARRAYFORMULA({REGEXREPLACE(TEXTJOIN(", ",1,
 QUERY(ARRAY_CONSTRAIN(SUBSTITUTE(IMPORTDATA(A2),"""",""),1000,15),
 "where Col1 contains '<meta property=og:title content='")),
 "<meta property=og:title content=| />",""),
 TRANSPOSE(REGEXEXTRACT(QUERY(TRANSPOSE(QUERY(TRANSPOSE(
 ARRAY_CONSTRAIN(SUBSTITUTE(IMPORTDATA(A2),"""",""),8000,3)),,50000)),
 "where Col1 contains '<a class=btn btn-secondary'"),"\>(.*)+\<"))})

demo spreadsheet