点击后访问下一个网页

Access next webpage after clicking

要求:点击下面$ie.Navigate中命名的网页后。我需要访问接下来打开的网页的 HTML / OuterHTML 源。

例如:当我打开 https://www.healthkartplus.com/search/all?name=Sporanox(通过设置 $control = Sporanox)时,下面的代码只是简单地点击第一个匹配项 link。单击 link 后,我需要访问结果页面的 HTML。

更新:提到了另一个 SO 问题,了解到我们可以搜索合适的 window。代码似乎适用于某些场景,但并非适用于所有场景。对于 $ie2,我在访问文档 属性.

时遇到问题
function getStringMatch
 {
    # Loop through all 2 digit combinations in the $path directory
    foreach ($control In $controls)
    {
        $ie = New-Object -COMObject InternetExplorer.Application
        $ie.visible = $true
        $site = $ie.Navigate("https://www.healthkartplus.com/search/all?name=$control")
        $ie.ReadyState

        while ($ie.Busy -and $ie.ReadyState -ne 4){ sleep -Milliseconds 100 }

        $link = $null
        $link = $ie.Document.get_links() | where-object {$_.innerText -eq "$control"}
        $link.click()

        while ($ie.Busy -and $ie.ReadyState -ne 4){ sleep -Milliseconds 100 }

       $ie2 = (New-Object -COM 'Shell.Application').Windows() | ? {
       $_.Name -eq 'Windows Internet Explorer' -and $_.LocationName -match "^$control"
       }

        # NEED outerHTML of new page. CURRENTLY it is working for some.

        $ie.Document.body.outerHTML > d:\med$control.txt
    }
}

$controls = "Sporanox"

getStringMatch

我认为问题出在您在第一页中查找 link 时。 link innerText 不等于 $control,它包含 $control 即 innerText 是 "Sporanox (100mg)".

以下内容可能会有所帮助:

$link = $ie.Document.get_links() | where-object {if ($_.innerText){$_.innerText.contains($control)}}

编辑

这是我使用的完整代码:

function getStringMatch
{
    # Loop through all 2 digit combinations in the $path directory
    foreach ($control In $controls)  
    {
        $ie = New-Object -COMObject InternetExplorer.Application
        $ie.visible = $true
        $site = $ie.Navigate("https://www.healthkartplus.com/search/all?name=$control")
        $ie.ReadyState

        while ($ie.Busy -and $ie.ReadyState -ne 4){ sleep -Milliseconds 100 }   

        $link = $null
        $link = $ie.Document.get_links() | where-object {if ($_.innerText){$_.innerText.contains($control)}}
        $link.click()

        while ($ie.Busy)
        { 
            sleep -Milliseconds 100 
        }

        # NEED outerHTML of new page. CURRENTLY it is working for some.

        $ie.Document.body.outerHTML > d:\med$control.txt
    }
}

$controls = "Sporanox"

getStringMatch