如何在 python 中使用 lxml 获取订单词?

how to get order word with lxml in python?

我使用 lxml 获取 html 内容中的文本,如下所示:

a = '<span class="p_show">testa1:<a title="xxxx" href="/drama/MnJXVQ==">testa2</a></span>'
b = '<span class="p_show"><a title="xxxx" href="/drama/JFxhX2Nv">testb</a>1:testb2</span>'

代码在这里:

from lxml import etree
html = etree.HTML(a) 
a1 = html.xpath('//span/text()')[0]
a2 = html.xpath('//span/a/text()')[0]
res = a1 + a2

res 是我想要的,但是对于 b 情况,我得到了相反的结果,你能帮我解决这个问题吗?谢谢大家!

from lxml import etree 
html = etree.HTML(a)  
a = html.xpath('//span//text()') 
res = ''.join(a)

这是适合我的解决方案!