Nokogiri 价格未显示
Nokogiri price doesn't show
谁能解释一下如何使用 nokogiri 检索价格的值?
出现在我创建的抓取器上的值是这个
Costa Rica
<span class="ProductListElement__price"> </span>
India
<span class="ProductListElement__price"> </span>
Indonesia
<span class="ProductListElement__price"> </span>
Colombia
<span class="ProductListElement__price"> </span>
Nicaragua
<span class="ProductListElement__price"> </span>
Ethiopia
<span class="ProductListElement__price"> </span>
Master Origin Pack (50 cápsulas)
<span class="ProductListElement__price"> </span>
CAFÉ İSTANBUL
<span class="ProductListElement__price"> </span>
Envivo Lungo
<span class="ProductListElement__price"> </span>
Fortissio Lungo
<span class="ProductListElement__price"> </span>
Vivalto Lungo
<span class="ProductListElement__price"> </span>
Linizio Lungo
<span class="ProductListElement__price"> </span>
Livanto
<span class="ProductListElement__price"> </span>
Capriccio
<span class="ProductListElement__price"> </span>
Volluto
<span class="ProductListElement__price"> </span>
Cosi
<span class="ProductListElement__price"> </span>
Kazaar
<span class="ProductListElement__price"> </span>
Dharkan
<span class="ProductListElement__price"> </span>
Ristretto
<span class="ProductListElement__price"> </span>
Arpeggio
<span class="ProductListElement__price"> </span>
Roma
<span class="ProductListElement__price"> </span>
Ristretto Decaffeinato
<span class="ProductListElement__price"> </span>
Arpeggio Decaffeinato
<span class="ProductListElement__price"> </span>
Volluto Decaffeinato
<span class="ProductListElement__price"> </span>
Vivalto Lungo Decaffeinato
<span class="ProductListElement__price"> </span>
Vanilio
<span class="ProductListElement__price"> </span>
Caramelito
<span class="ProductListElement__price"> </span>
我的控制器是这样的:
class CupsController < ApplicationController
class Entry
def initialize(name, price)
@name = name
@price = price
end
attr_reader :name
attr_reader :price
end
def cups
require 'open-uri'
require 'nokogiri'
doc = Nokogiri::HTML(open('https://www.nespresso.com/pt/pt/order/capsules'))
entries = doc.css("article.ProductListElement")
@entriesArray = []
entries.each do |entry|
name = entry.css('.ProductListElement__name').text
price = entry.css('span.ProductListElement__price')
@entriesArray << Entry.new(name, price)
@name = name
@price = price
end
render template: 'cups/home'
end
end
这是我收到的每个结果:
<span class="ProductListElement__price"> </span>
这对我来说表明网页加载后 JavaScript 会动态加载价格。
为了能够抓取动态加载的数据,您需要使用像 Watir 这样的库,Rails 5.
使用 Watir,您可以等到所有脚本都已执行并加载所有数据,然后再尝试抓取网站。
谁能解释一下如何使用 nokogiri 检索价格的值? 出现在我创建的抓取器上的值是这个
Costa Rica
<span class="ProductListElement__price"> </span>
India
<span class="ProductListElement__price"> </span>
Indonesia
<span class="ProductListElement__price"> </span>
Colombia
<span class="ProductListElement__price"> </span>
Nicaragua
<span class="ProductListElement__price"> </span>
Ethiopia
<span class="ProductListElement__price"> </span>
Master Origin Pack (50 cápsulas)
<span class="ProductListElement__price"> </span>
CAFÉ İSTANBUL
<span class="ProductListElement__price"> </span>
Envivo Lungo
<span class="ProductListElement__price"> </span>
Fortissio Lungo
<span class="ProductListElement__price"> </span>
Vivalto Lungo
<span class="ProductListElement__price"> </span>
Linizio Lungo
<span class="ProductListElement__price"> </span>
Livanto
<span class="ProductListElement__price"> </span>
Capriccio
<span class="ProductListElement__price"> </span>
Volluto
<span class="ProductListElement__price"> </span>
Cosi
<span class="ProductListElement__price"> </span>
Kazaar
<span class="ProductListElement__price"> </span>
Dharkan
<span class="ProductListElement__price"> </span>
Ristretto
<span class="ProductListElement__price"> </span>
Arpeggio
<span class="ProductListElement__price"> </span>
Roma
<span class="ProductListElement__price"> </span>
Ristretto Decaffeinato
<span class="ProductListElement__price"> </span>
Arpeggio Decaffeinato
<span class="ProductListElement__price"> </span>
Volluto Decaffeinato
<span class="ProductListElement__price"> </span>
Vivalto Lungo Decaffeinato
<span class="ProductListElement__price"> </span>
Vanilio
<span class="ProductListElement__price"> </span>
Caramelito
<span class="ProductListElement__price"> </span>
我的控制器是这样的:
class CupsController < ApplicationController
class Entry
def initialize(name, price)
@name = name
@price = price
end
attr_reader :name
attr_reader :price
end
def cups
require 'open-uri'
require 'nokogiri'
doc = Nokogiri::HTML(open('https://www.nespresso.com/pt/pt/order/capsules'))
entries = doc.css("article.ProductListElement")
@entriesArray = []
entries.each do |entry|
name = entry.css('.ProductListElement__name').text
price = entry.css('span.ProductListElement__price')
@entriesArray << Entry.new(name, price)
@name = name
@price = price
end
render template: 'cups/home'
end
end
这是我收到的每个结果:
<span class="ProductListElement__price"> </span>
这对我来说表明网页加载后 JavaScript 会动态加载价格。
为了能够抓取动态加载的数据,您需要使用像 Watir 这样的库,Rails 5.
使用 Watir,您可以等到所有脚本都已执行并加载所有数据,然后再尝试抓取网站。