GCP AI Platform Notebook 驱动太旧?
GCP AI Platform Notebook driver too old?
我正在尝试 运行 在具有 32 个 vCPU、208 GB RAM 和 2 个 NVIDIA Tesla T4 的 GCP 人工智能平台笔记本上 Hugging Face Transformers tutorial。
然而,当我尝试 运行 部分
model = DistillBERTClass()
model.to(device)
我收到以下断言错误:
AssertionError: The NVIDIA driver on your system is too old (found version 10010).
Please update your GPU driver by downloading and installing a new
version from the URL: http://www.nvidia.com/Download/index.aspx
Alternatively, go to: https://pytorch.org to install
a PyTorch version that has been compiled with your version
of the CUDA driver.
然而,当我 运行
!nvidia-smi
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.87.01 Driver Version: 418.87.01 CUDA Version: 10.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla T4 Off | 00000000:00:04.0 Off | 0 |
| N/A 38C P0 22W / 70W | 10MiB / 15079MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 1 Tesla T4 Off | 00000000:00:05.0 Off | 0 |
| N/A 39C P8 10W / 70W | 10MiB / 15079MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
NVIDIA驱动上的版本与我使用的最新PyTorch版本兼容。
有没有其他人 运行 遇到这个错误,有没有办法解决它?
您可以:
请下载并安装新的 GPU 驱动程序以更新您的 GPU 驱动程序
来自 URL 的版本:http://www.nvidia.com/Download/index.aspx
或者,转到:https://pytorch.org 安装 PyTorch
使用您的 CUDA 驱动程序版本编译的版本。
您可以尝试更新的NVIDIA驱动版本,我们支持最新的CUDA 11驱动版本,然后在其上安装Pytorch:
gcloud beta notebooks instances create cuda11 \
--vm-image-project=deeplearning-platform-release \
--vm-image-family=common-cu110-notebooks-debian-9 \
--machine-type=n1-standard-1 \
--location=us-west1-a \
--format=json
图像系列:
- 普通-cu110-notebooks-debian-9
- 普通-cu110-notebooks-debian-10
我正在尝试 运行 在具有 32 个 vCPU、208 GB RAM 和 2 个 NVIDIA Tesla T4 的 GCP 人工智能平台笔记本上 Hugging Face Transformers tutorial。
然而,当我尝试 运行 部分
model = DistillBERTClass()
model.to(device)
我收到以下断言错误:
AssertionError: The NVIDIA driver on your system is too old (found version 10010).
Please update your GPU driver by downloading and installing a new
version from the URL: http://www.nvidia.com/Download/index.aspx
Alternatively, go to: https://pytorch.org to install
a PyTorch version that has been compiled with your version
of the CUDA driver.
然而,当我 运行 !nvidia-smi
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.87.01 Driver Version: 418.87.01 CUDA Version: 10.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Tesla T4 Off | 00000000:00:04.0 Off | 0 |
| N/A 38C P0 22W / 70W | 10MiB / 15079MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
| 1 Tesla T4 Off | 00000000:00:05.0 Off | 0 |
| N/A 39C P8 10W / 70W | 10MiB / 15079MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
NVIDIA驱动上的版本与我使用的最新PyTorch版本兼容。 有没有其他人 运行 遇到这个错误,有没有办法解决它?
您可以:
请下载并安装新的 GPU 驱动程序以更新您的 GPU 驱动程序 来自 URL 的版本:http://www.nvidia.com/Download/index.aspx
或者,转到:https://pytorch.org 安装 PyTorch 使用您的 CUDA 驱动程序版本编译的版本。
您可以尝试更新的NVIDIA驱动版本,我们支持最新的CUDA 11驱动版本,然后在其上安装Pytorch:
gcloud beta notebooks instances create cuda11 \
--vm-image-project=deeplearning-platform-release \
--vm-image-family=common-cu110-notebooks-debian-9 \
--machine-type=n1-standard-1 \
--location=us-west1-a \
--format=json
图像系列:
- 普通-cu110-notebooks-debian-9
- 普通-cu110-notebooks-debian-10