我如何通过我的程序获得某个类别的特定 interwiki link?
How can i get specific interwiki link of a category by my programme?
这是我的程序,它获取所有 interwiki 链接(包含许多 li
标签)。但我只想获得一种特定的语言 li
标签,如下所示。
<li class="interlanguage-link interwiki-ta"> ...title= </li>
如何获取特定 title=
之后的数据?
我怎样才能完成我的代码如下;-
命令:python3 get-tamiwiki-link-from-englishwiki.py
from bs4 import BeautifulSoup
import requests
url = 'https://en.wikipedia.org/wiki/Category:proprietary software'
content = requests.get(url).content
soup = BeautifulSoup(content,'lxml')
#to get all the li tag
interwikihead = soup.find(id='p-lang')
print(interwikihead)
#print(interwikihead.text)
from bs4 import BeautifulSoup
import requests
# li class="interlanguage-link interwiki-ta"
url = 'https://en.wikipedia.org/wiki/Category:proprietary software'
content = requests.get(url).content
soup = BeautifulSoup(content,'lxml')
#to get all the li tag
interwikihead = soup.find('li', class_="interlanguage-link interwiki-ta")
print(interwikihead.text)
try:
title = interwikihead.a.get('title')
print(title)
except:
print('title no find')
输出:
தமிழ்
பகுப்பு:தனியுடைமை மென்பொருட்கள் – Tamil
这是我的程序,它获取所有 interwiki 链接(包含许多 li
标签)。但我只想获得一种特定的语言 li
标签,如下所示。
<li class="interlanguage-link interwiki-ta"> ...title= </li>
如何获取特定 title=
之后的数据?
我怎样才能完成我的代码如下;-
命令:python3 get-tamiwiki-link-from-englishwiki.py
from bs4 import BeautifulSoup
import requests
url = 'https://en.wikipedia.org/wiki/Category:proprietary software'
content = requests.get(url).content
soup = BeautifulSoup(content,'lxml')
#to get all the li tag
interwikihead = soup.find(id='p-lang')
print(interwikihead)
#print(interwikihead.text)
from bs4 import BeautifulSoup
import requests
# li class="interlanguage-link interwiki-ta"
url = 'https://en.wikipedia.org/wiki/Category:proprietary software'
content = requests.get(url).content
soup = BeautifulSoup(content,'lxml')
#to get all the li tag
interwikihead = soup.find('li', class_="interlanguage-link interwiki-ta")
print(interwikihead.text)
try:
title = interwikihead.a.get('title')
print(title)
except:
print('title no find')
输出:
தமிழ்
பகுப்பு:தனியுடைமை மென்பொருட்கள் – Tamil