使用 Python Selenium 获取当前名称
Getting current designation using Python Selenium
我正在尝试使用 Python Selenium 获取 LinkedIn 个人资料列表的当前名称。我想要 'Current:' 和 'Summary:' 之后的部分。
这是HTML:
<div class="linked-area flex-1 cursor-pointer">
<p class="entity-result__summary entity-result__summary--2-lines t-12 t-black--light ">
<!---->Current: Full Stack Software<span class="white-space-pre"> </span>
<strong><!---->Developer<!----></strong><span class="white-space-pre"> </span>at GE Healthcare<!---->
</p>
</div>
我试过这个:
currentDsgn = []
currentDesignations = browser.find_elements_by_class_name('linked-area flex-1')
print(currentDesignations)
for currentDesignation in currentDesignations:
print(currentDesignation)
currentDsgn.append(currentDesignation.text.strip())
但是我得到一个空列表。
改用这个 xpath :
//div[contains(@class, 'linked-area')]/p[contains(@class, 'entity-result__summary')]
并像下面这样使用它:
currentDesignations = browser.find_elements_by_xpath("//div[contains(@class, 'linked-area')]/p[contains(@class, 'entity-result__summary')]")
for currentDesignation in currentDesignations:
print(currentDesignation.get_attribute('innerHTML'))
我正在尝试使用 Python Selenium 获取 LinkedIn 个人资料列表的当前名称。我想要 'Current:' 和 'Summary:' 之后的部分。
这是HTML:
<div class="linked-area flex-1 cursor-pointer">
<p class="entity-result__summary entity-result__summary--2-lines t-12 t-black--light ">
<!---->Current: Full Stack Software<span class="white-space-pre"> </span>
<strong><!---->Developer<!----></strong><span class="white-space-pre"> </span>at GE Healthcare<!---->
</p>
</div>
我试过这个:
currentDsgn = []
currentDesignations = browser.find_elements_by_class_name('linked-area flex-1')
print(currentDesignations)
for currentDesignation in currentDesignations:
print(currentDesignation)
currentDsgn.append(currentDesignation.text.strip())
但是我得到一个空列表。
改用这个 xpath :
//div[contains(@class, 'linked-area')]/p[contains(@class, 'entity-result__summary')]
并像下面这样使用它:
currentDesignations = browser.find_elements_by_xpath("//div[contains(@class, 'linked-area')]/p[contains(@class, 'entity-result__summary')]")
for currentDesignation in currentDesignations:
print(currentDesignation.get_attribute('innerHTML'))