获取特定 NCT ID 历史记录的临床试验

Question

我需要来自站点的特定临床试验 NCT ID 的完整历史记录：https://clinicaltrials.gov/

考虑NCT id :NCT03245346

来自 link I am checking for History of Changes which is giving me all history for that NCT ID in new page.

我可以使用 HTML 解析器得到这个：

import BeautifulSoup
import requests

url = 'https://clinicaltrials.gov/ct2/archive/NCT03245346'
r=requests.get(url)
url=r.content
soup = BeautifulSoup(url, 'html.parser')

tab = soup.find("table", {"class":"ct-data_table tr-data_table tr-tableStyle"})
print(tab)

但是为了避免HTML页面格式，我想知道，有没有API可以获取特定NCT ID的完整历史记录？

Answer 1

如果你只想得到table，你可以试试pandasread_html()函数：

import pandas as pd

url = "https://clinicaltrials.gov/ct2/archive/NCT03245346"

df = pd.read_html(url)[0]

df.head()

    0                               1
0   ClinicalTrials.gov Identifier:  NCT03245346
1   Study Title:                    Effects of Epidural Anesthesia and Analgesia o...
2   First Submitted:                August 2, 2017
3   Last Update Posted:             April 24, 2018

这也适用于更多 "detailed" 概览，当您单击 继续查看 ClinicalTrials.gov 存档站点 上此研究的更改历史记录时：

url_detail = "https://clinicaltrials.gov/ct2/history/NCT03245346"

df = pd.read_html(url_detail)[0]

但是，如果您正在寻找其他东西，也许我们也可以解决这个问题。

获取特定 NCT ID 历史记录的临床试验

clinical trial for getting history for particular NCT ID

python

api

bioinformatics

medical