使用 rvest 获取 Google 结果标题时获取 "Character(0)" 错误

Question

抱歉，如果我的问题很简单或问得不好，我对使用 R 进行网络抓取还很陌生。

我正在尝试从 Google 搜索中抓取头条新闻。抱歉，如果它与之前在下面 link 中提出的请求完全相同，但是它对我不起作用（它仍然 returns "字符(0)" ).

根据上面 link 中提供的答案，这是我尝试过的两个脚本：

#脚本 1

library(rvest)
library(dplyr)
web1 <- read_html("https://www.google.at/search?q=munich+prices")  
web1 %>% 
    html_nodes(xpath = '//div/div/div/a/div[not(div)]') %>% 
    html_text

#脚本 2

library(rvest)
library(dplyr) 
web1 <- read_html("https://www.google.at/search?q=munich+prices")  
web1 %>%
    html_nodes(xpath = '//div/div/div/a/h3/div[not(div)]') %>%
    html_text

这两个脚本对我来说仍然是return“字符(0)”。

有人知道吗？

感谢您的帮助。维克多

Answer 1

这里要求截图，

   library(rvest)
    library(dplyr) 
    web1 <- read_html("https://www.google.at/search?q=munich+prices")  
    web1 %>%
        html_nodes(xpath = '//div/div/div/a/h3/div[not(div)]') %>%
        html_text

使用 rvest 获取 Google 结果标题时获取 "Character(0)" 错误

Obtaining "Character(0)" error when using rvest to get Google results headlines

r

google-search

web-scraping