在当前不可调试的 GPU 上创建了 CUDA 上下文
A CUDA context was created on a GPU that is not currently debuggable
当我启动 cuda 调试时,Nsight return 这个错误:
A CUDA context was created on a GPU that is not currently debuggable.
Breakpoints will be disabled.
Adapter: GeForce GT 720M
这是我的系统和CUDA信息。
请注意,已安装最新版本的 CUDA 和 Nsight。
我搜索了这个问题,但找不到我的答案。
非常感谢。
Report Information
UnixTime Generated 1490538033
OS Information
Computer Name DESKTOP-OLFM6NT
NetBIOS Name DESKTOP-OLFM6NT
OS Name Windows 10 Pro
GetVersionEx
dwMajorVersion 10
dwMinorVersion 0
dwBuildNumber 14393
dwPlatformId 2
wServicePackMajor 0
wServicePackMinor 0
wSuiteMask 256
wProductType Workstation
GetProductInfo 48
GetNativeSystemInfo
wProcessorArchitecture x64
dwPageSize 4096
lpMinimumApplicationAddress 65536
lpMaximumApplicationAddress 140737488289791
dwActiveProcessorMask 15
dwNumberOfProcessors 4
dwAllocationGranularity 65536
wProcessorLevel 6
wProcessorRevision 17665
EnumDisplayDevices
Display Device
DeviceName \.\DISPLAY1
DeviceString Intel(R) HD Graphics Family
StateFlags 5
DeviceID PCI\VEN_8086&DEV_0A16&SUBSYS_397817AA&REV_09
DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Video\{A9611CC2-95E1-4DAE-9937-60210AFEDCE0}[=10=]00
Monitor
DeviceName \.\DISPLAY1\Monitor0
DeviceString Generic PnP Monitor
StateFlags 3
DeviceID MONITOR\CMN15B6\{4d36e96e-e325-11ce-bfc1-08002be10318}[=10=]03
DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Class\{4d36e96e-e325-11ce-bfc1-08002be10318}[=10=]03
Display Device
DeviceName \.\DISPLAY2
DeviceString Intel(R) HD Graphics Family
StateFlags 1
DeviceID PCI\VEN_8086&DEV_0A16&SUBSYS_397817AA&REV_09
DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Video\{A9611CC2-95E1-4DAE-9937-60210AFEDCE0}[=10=]01
Monitor
DeviceName \.\DISPLAY2\Monitor0
DeviceString Generic PnP Monitor
StateFlags 3
DeviceID MONITOR\SAM04FD\{4d36e96e-e325-11ce-bfc1-08002be10318}[=10=]04
DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Class\{4d36e96e-e325-11ce-bfc1-08002be10318}[=10=]04
Display Device
DeviceName \.\DISPLAY3
DeviceString Intel(R) HD Graphics Family
StateFlags 0
DeviceID PCI\VEN_8086&DEV_0A16&SUBSYS_397817AA&REV_09
DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Video\{A9611CC2-95E1-4DAE-9937-60210AFEDCE0}[=10=]02
GlobalMemoryStatusEx
dwMemoryLoad 34
ullTotalPhys 8486227968
ullAvailPhys 5588660224
ullTotalPageFile 13854937088
ullAvailPageFile 10756182016
ullTotalVirtual 140737488224256
ullAvailVirtual 140737442308096
Processor Information
0
Name Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz
Clock speed (MHz) 2394
1
Name Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz
Clock speed (MHz) 2394
2
Name Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz
Clock speed (MHz) 2394
3
Name Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz
Clock speed (MHz) 2394
NvAPI
IsMSHybridGraphics True
DisplayDriverVersion
Driver Version 37609
Changelist 0
BuildBranchString r376_06
Default AdapterString GeForce GT 720M
DisplayDriverCompileType Release
NvDebugApi
WDDM Devices
GPU
Name GeForce GT 720M
Architecture Fermi
Architecture Number 208
Architecture Implementation 7
Architecture Revision 162
Number of GPCs 1
Number of TPCs 2
Number of SMs 2
Warps per SM 48
Lanes per warp 32
Register file size 32768
Max CTAs per SM 8
Max size of shared memory per CTA (bytes) 49152
SM Revision 131073
Number of FB PAs 6
Number of LTs per LTC 2
RmGpuId 1024
RM Devices
CUDA
CUDA Device
Name GeForce GT 720M
Driver WDDM
DeviceIndex 0
GPU Family GF117
RmGpuId 1024
Compute Major 2
Compute Minor 1
MAX_THREADS_PER_BLOCK 1024
MAX_BLOCK_DIM_X 1024
MAX_BLOCK_DIM_Y 1024
MAX_BLOCK_DIM_Z 64
MAX_GRID_DIM_X 65535
MAX_GRID_DIM_Y 65535
MAX_GRID_DIM_Z 65535
MAX_SHARED_MEMORY_PER_BLOCK 49152
TOTAL_CONSTANT_MEMORY 65536
WARP_SIZE 32
MAX_PITCH 2147483647
MAX_REGISTERS_PER_BLOCK 32768
CLOCK_RATE 1550000
TEXTURE_ALIGNMENT 512
GPU_OVERLAP 1
MULTIPROCESSOR_COUNT 2
KERNEL_EXEC_TIMEOUT 0
INTEGRATED 0
CAN_MAP_HOST_MEMORY 1
COMPUTE_MODE 0
MAXIMUM_TEXTURE1D_WIDTH 65536
MAXIMUM_TEXTURE2D_WIDTH 65536
MAXIMUM_TEXTURE2D_HEIGHT 65535
MAXIMUM_TEXTURE3D_WIDTH 2048
MAXIMUM_TEXTURE3D_HEIGHT 2048
MAXIMUM_TEXTURE3D_DEPTH 2048
MAXIMUM_TEXTURE2D_LAYERED_WIDTH 16384
MAXIMUM_TEXTURE2D_LAYERED_HEIGHT 16384
MAXIMUM_TEXTURE2D_LAYERED_LAYERS 2048
SURFACE_ALIGNMENT 512
CONCURRENT_KERNELS 1
ECC_ENABLED 0
PCI_BUS_ID 4
PCI_DEVICE_ID 0
TCC_DRIVER 0
MEMORY_CLOCK_RATE 900000
GLOBAL_MEMORY_BUS_WIDTH 64
L2_CACHE_SIZE 131072
MAX_THREADS_PER_MULTIPROCESSOR 1536
ASYNC_ENGINE_COUNT 1
UNIFIED_ADDRESSING 1
MAXIMUM_TEXTURE1D_LAYERED_WIDTH 16384
MAXIMUM_TEXTURE1D_LAYERED_LAYERS 2048
CAN_TEX2D_GATHER 1
MAXIMUM_TEXTURE2D_GATHER_WIDTH 16384
MAXIMUM_TEXTURE2D_GATHER_HEIGHT 16384
MAXIMUM_TEXTURE3D_WIDTH_ALTERNATE 0
MAXIMUM_TEXTURE3D_HEIGHT_ALTERNATE 0
MAXIMUM_TEXTURE3D_DEPTH_ALTERNATE 0
PCI_DOMAIN_ID 0
TEXTURE_PITCH_ALIGNMENT 32
MAXIMUM_TEXTURECUBEMAP_WIDTH 16384
MAXIMUM_TEXTURECUBEMAP_LAYERED_WIDTH 16384
MAXIMUM_TEXTURECUBEMAP_LAYERED_LAYERS 2046
MAXIMUM_SURFACE1D_WIDTH 65536
MAXIMUM_SURFACE2D_WIDTH 65536
MAXIMUM_SURFACE2D_HEIGHT 32768
MAXIMUM_SURFACE3D_WIDTH 65536
MAXIMUM_SURFACE3D_HEIGHT 32768
MAXIMUM_SURFACE3D_DEPTH 2048
MAXIMUM_SURFACE1D_LAYERED_WIDTH 65536
MAXIMUM_SURFACE1D_LAYERED_LAYERS 2048
MAXIMUM_SURFACE2D_LAYERED_WIDTH 65536
MAXIMUM_SURFACE2D_LAYERED_HEIGHT 32768
MAXIMUM_SURFACE2D_LAYERED_LAYERS 2048
MAXIMUM_SURFACECUBEMAP_WIDTH 32768
MAXIMUM_SURFACECUBEMAP_LAYERED_WIDTH 32768
MAXIMUM_SURFACECUBEMAP_LAYERED_LAYERS 2046
MAXIMUM_TEXTURE1D_LINEAR_WIDTH 134217728
MAXIMUM_TEXTURE2D_LINEAR_WIDTH 65000
MAXIMUM_TEXTURE2D_LINEAR_HEIGHT 65000
MAXIMUM_TEXTURE2D_LINEAR_PITCH 1048544
MAXIMUM_TEXTURE2D_MIPMAPPED_WIDTH 16384
MAXIMUM_TEXTURE2D_MIPMAPPED_HEIGHT 16384
MAXIMUM_TEXTURE1D_MIPMAPPED_WIDTH 16384
STREAM_PRIORITIES_SUPPORTED 0
GLOBAL_L1_CACHE_SUPPORTED 1
LOCAL_L1_CACHE_SUPPORTED 1
MAX_SHARED_MEMORY_PER_MULTIPROCESSOR 49152
MAX_REGISTERS_PER_MULTIPROCESSOR 32768
MANAGED_MEMORY 0
MULTI_GPU_BOARD 0
MULTI_GPU_BOARD_GROUP_ID 0
HOST_NATIVE_ATOMIC_SUPPORTED 0
SINGLE_TO_DOUBLE_PRECISION_PERF_RATIO 12
PAGEABLE_MEMORY_ACCESS 0
CONCURRENT_MANAGED_ACCESS 0
COMPUTE_PREEMPTION_SUPPORTED 0
CAN_USE_HOST_POINTER_FOR_REGISTERED_MEM 0
DISPLAY_NAME GeForce GT 720M
COMPUTE_CAPABILITY_MAJOR 2
COMPUTE_CAPABILITY_MINOR 1
TOTAL_MEMORY 2147483648
RAM_TYPE 7
RAM_LOCATION 1
GPU_PCI_DEVICE_ID 289411294
GPU_PCI_SUB_SYSTEM_ID 939530154
GPU_PCI_REVISION_ID 161
GPU_PCI_EXT_DEVICE_ID 4416
GPU_PCI_EXT_GEN 1
GPU_PCI_EXT_GPU_GEN 1
GPU_PCI_EXT_GPU_LINK_RATE 5000
GPU_PCI_EXT_GPU_LINK_WIDTH 8
GPU_PCI_EXT_DOWNSTREAM_LINK_RATE 5000
GPU_PCI_EXT_DOWNSTREAM_LINK_WIDTH 4
您的 GT 720m 是计算能力为 2.1 的设备(参见 here)。
尝试在同样支持(托管)显示器的 GPU 上调试 CUDA 代码(例如设置断点)需要计算能力为 3.5 或更高的设备,以支持 preemption。
您的设备不符合该要求,因此由于您的 GPU 正在托管您的笔记本电脑显示器,因此不能用于在 CUDA 代码中设置断点。
另请注意,最新版本的 Nsight VSE(此时为 5.2)具有 officially dropped support 用于 Fermi GPU(您的是 Fermi GPU):
Note: Fermi family GPUs, and older families, are no longer supported with Nsight™ Visual Studio Edition 5.2 or better.
也许尝试回退到旧版本的 Nsight。
我最近 运行 遇到了这个问题,安装旧版本很有帮助。
当我启动 cuda 调试时,Nsight return 这个错误:
A CUDA context was created on a GPU that is not currently debuggable. Breakpoints will be disabled.
Adapter: GeForce GT 720M
这是我的系统和CUDA信息。
请注意,已安装最新版本的 CUDA 和 Nsight。
我搜索了这个问题,但找不到我的答案。 非常感谢。
Report Information UnixTime Generated 1490538033 OS Information Computer Name DESKTOP-OLFM6NT NetBIOS Name DESKTOP-OLFM6NT OS Name Windows 10 Pro GetVersionEx dwMajorVersion 10 dwMinorVersion 0 dwBuildNumber 14393 dwPlatformId 2 wServicePackMajor 0 wServicePackMinor 0 wSuiteMask 256 wProductType Workstation GetProductInfo 48 GetNativeSystemInfo wProcessorArchitecture x64 dwPageSize 4096 lpMinimumApplicationAddress 65536 lpMaximumApplicationAddress 140737488289791 dwActiveProcessorMask 15 dwNumberOfProcessors 4 dwAllocationGranularity 65536 wProcessorLevel 6 wProcessorRevision 17665 EnumDisplayDevices Display Device DeviceName \.\DISPLAY1 DeviceString Intel(R) HD Graphics Family StateFlags 5 DeviceID PCI\VEN_8086&DEV_0A16&SUBSYS_397817AA&REV_09 DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Video\{A9611CC2-95E1-4DAE-9937-60210AFEDCE0}[=10=]00 Monitor DeviceName \.\DISPLAY1\Monitor0 DeviceString Generic PnP Monitor StateFlags 3 DeviceID MONITOR\CMN15B6\{4d36e96e-e325-11ce-bfc1-08002be10318}[=10=]03 DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Class\{4d36e96e-e325-11ce-bfc1-08002be10318}[=10=]03 Display Device DeviceName \.\DISPLAY2 DeviceString Intel(R) HD Graphics Family StateFlags 1 DeviceID PCI\VEN_8086&DEV_0A16&SUBSYS_397817AA&REV_09 DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Video\{A9611CC2-95E1-4DAE-9937-60210AFEDCE0}[=10=]01 Monitor DeviceName \.\DISPLAY2\Monitor0 DeviceString Generic PnP Monitor StateFlags 3 DeviceID MONITOR\SAM04FD\{4d36e96e-e325-11ce-bfc1-08002be10318}[=10=]04 DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Class\{4d36e96e-e325-11ce-bfc1-08002be10318}[=10=]04 Display Device DeviceName \.\DISPLAY3 DeviceString Intel(R) HD Graphics Family StateFlags 0 DeviceID PCI\VEN_8086&DEV_0A16&SUBSYS_397817AA&REV_09 DeviceKey \Registry\Machine\System\CurrentControlSet\Control\Video\{A9611CC2-95E1-4DAE-9937-60210AFEDCE0}[=10=]02 GlobalMemoryStatusEx dwMemoryLoad 34 ullTotalPhys 8486227968 ullAvailPhys 5588660224 ullTotalPageFile 13854937088 ullAvailPageFile 10756182016 ullTotalVirtual 140737488224256 ullAvailVirtual 140737442308096 Processor Information 0 Name Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz Clock speed (MHz) 2394 1 Name Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz Clock speed (MHz) 2394 2 Name Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz Clock speed (MHz) 2394 3 Name Intel(R) Core(TM) i7-4500U CPU @ 1.80GHz Clock speed (MHz) 2394 NvAPI IsMSHybridGraphics True DisplayDriverVersion Driver Version 37609 Changelist 0 BuildBranchString r376_06 Default AdapterString GeForce GT 720M DisplayDriverCompileType Release NvDebugApi WDDM Devices GPU Name GeForce GT 720M Architecture Fermi Architecture Number 208 Architecture Implementation 7 Architecture Revision 162 Number of GPCs 1 Number of TPCs 2 Number of SMs 2 Warps per SM 48 Lanes per warp 32 Register file size 32768 Max CTAs per SM 8 Max size of shared memory per CTA (bytes) 49152 SM Revision 131073 Number of FB PAs 6 Number of LTs per LTC 2 RmGpuId 1024 RM Devices CUDA CUDA Device Name GeForce GT 720M Driver WDDM DeviceIndex 0 GPU Family GF117 RmGpuId 1024 Compute Major 2 Compute Minor 1 MAX_THREADS_PER_BLOCK 1024 MAX_BLOCK_DIM_X 1024 MAX_BLOCK_DIM_Y 1024 MAX_BLOCK_DIM_Z 64 MAX_GRID_DIM_X 65535 MAX_GRID_DIM_Y 65535 MAX_GRID_DIM_Z 65535 MAX_SHARED_MEMORY_PER_BLOCK 49152 TOTAL_CONSTANT_MEMORY 65536 WARP_SIZE 32 MAX_PITCH 2147483647 MAX_REGISTERS_PER_BLOCK 32768 CLOCK_RATE 1550000 TEXTURE_ALIGNMENT 512 GPU_OVERLAP 1 MULTIPROCESSOR_COUNT 2 KERNEL_EXEC_TIMEOUT 0 INTEGRATED 0 CAN_MAP_HOST_MEMORY 1 COMPUTE_MODE 0 MAXIMUM_TEXTURE1D_WIDTH 65536 MAXIMUM_TEXTURE2D_WIDTH 65536 MAXIMUM_TEXTURE2D_HEIGHT 65535 MAXIMUM_TEXTURE3D_WIDTH 2048 MAXIMUM_TEXTURE3D_HEIGHT 2048 MAXIMUM_TEXTURE3D_DEPTH 2048 MAXIMUM_TEXTURE2D_LAYERED_WIDTH 16384 MAXIMUM_TEXTURE2D_LAYERED_HEIGHT 16384 MAXIMUM_TEXTURE2D_LAYERED_LAYERS 2048 SURFACE_ALIGNMENT 512 CONCURRENT_KERNELS 1 ECC_ENABLED 0 PCI_BUS_ID 4 PCI_DEVICE_ID 0 TCC_DRIVER 0 MEMORY_CLOCK_RATE 900000 GLOBAL_MEMORY_BUS_WIDTH 64 L2_CACHE_SIZE 131072 MAX_THREADS_PER_MULTIPROCESSOR 1536 ASYNC_ENGINE_COUNT 1 UNIFIED_ADDRESSING 1 MAXIMUM_TEXTURE1D_LAYERED_WIDTH 16384 MAXIMUM_TEXTURE1D_LAYERED_LAYERS 2048 CAN_TEX2D_GATHER 1 MAXIMUM_TEXTURE2D_GATHER_WIDTH 16384 MAXIMUM_TEXTURE2D_GATHER_HEIGHT 16384 MAXIMUM_TEXTURE3D_WIDTH_ALTERNATE 0 MAXIMUM_TEXTURE3D_HEIGHT_ALTERNATE 0 MAXIMUM_TEXTURE3D_DEPTH_ALTERNATE 0 PCI_DOMAIN_ID 0 TEXTURE_PITCH_ALIGNMENT 32 MAXIMUM_TEXTURECUBEMAP_WIDTH 16384 MAXIMUM_TEXTURECUBEMAP_LAYERED_WIDTH 16384 MAXIMUM_TEXTURECUBEMAP_LAYERED_LAYERS 2046 MAXIMUM_SURFACE1D_WIDTH 65536 MAXIMUM_SURFACE2D_WIDTH 65536 MAXIMUM_SURFACE2D_HEIGHT 32768 MAXIMUM_SURFACE3D_WIDTH 65536 MAXIMUM_SURFACE3D_HEIGHT 32768 MAXIMUM_SURFACE3D_DEPTH 2048 MAXIMUM_SURFACE1D_LAYERED_WIDTH 65536 MAXIMUM_SURFACE1D_LAYERED_LAYERS 2048 MAXIMUM_SURFACE2D_LAYERED_WIDTH 65536 MAXIMUM_SURFACE2D_LAYERED_HEIGHT 32768 MAXIMUM_SURFACE2D_LAYERED_LAYERS 2048 MAXIMUM_SURFACECUBEMAP_WIDTH 32768 MAXIMUM_SURFACECUBEMAP_LAYERED_WIDTH 32768 MAXIMUM_SURFACECUBEMAP_LAYERED_LAYERS 2046 MAXIMUM_TEXTURE1D_LINEAR_WIDTH 134217728 MAXIMUM_TEXTURE2D_LINEAR_WIDTH 65000 MAXIMUM_TEXTURE2D_LINEAR_HEIGHT 65000 MAXIMUM_TEXTURE2D_LINEAR_PITCH 1048544 MAXIMUM_TEXTURE2D_MIPMAPPED_WIDTH 16384 MAXIMUM_TEXTURE2D_MIPMAPPED_HEIGHT 16384 MAXIMUM_TEXTURE1D_MIPMAPPED_WIDTH 16384 STREAM_PRIORITIES_SUPPORTED 0 GLOBAL_L1_CACHE_SUPPORTED 1 LOCAL_L1_CACHE_SUPPORTED 1 MAX_SHARED_MEMORY_PER_MULTIPROCESSOR 49152 MAX_REGISTERS_PER_MULTIPROCESSOR 32768 MANAGED_MEMORY 0 MULTI_GPU_BOARD 0 MULTI_GPU_BOARD_GROUP_ID 0 HOST_NATIVE_ATOMIC_SUPPORTED 0 SINGLE_TO_DOUBLE_PRECISION_PERF_RATIO 12 PAGEABLE_MEMORY_ACCESS 0 CONCURRENT_MANAGED_ACCESS 0 COMPUTE_PREEMPTION_SUPPORTED 0 CAN_USE_HOST_POINTER_FOR_REGISTERED_MEM 0 DISPLAY_NAME GeForce GT 720M COMPUTE_CAPABILITY_MAJOR 2 COMPUTE_CAPABILITY_MINOR 1 TOTAL_MEMORY 2147483648 RAM_TYPE 7 RAM_LOCATION 1 GPU_PCI_DEVICE_ID 289411294 GPU_PCI_SUB_SYSTEM_ID 939530154 GPU_PCI_REVISION_ID 161 GPU_PCI_EXT_DEVICE_ID 4416 GPU_PCI_EXT_GEN 1 GPU_PCI_EXT_GPU_GEN 1 GPU_PCI_EXT_GPU_LINK_RATE 5000 GPU_PCI_EXT_GPU_LINK_WIDTH 8 GPU_PCI_EXT_DOWNSTREAM_LINK_RATE 5000 GPU_PCI_EXT_DOWNSTREAM_LINK_WIDTH 4
您的 GT 720m 是计算能力为 2.1 的设备(参见 here)。
尝试在同样支持(托管)显示器的 GPU 上调试 CUDA 代码(例如设置断点)需要计算能力为 3.5 或更高的设备,以支持 preemption。
您的设备不符合该要求,因此由于您的 GPU 正在托管您的笔记本电脑显示器,因此不能用于在 CUDA 代码中设置断点。
另请注意,最新版本的 Nsight VSE(此时为 5.2)具有 officially dropped support 用于 Fermi GPU(您的是 Fermi GPU):
Note: Fermi family GPUs, and older families, are no longer supported with Nsight™ Visual Studio Edition 5.2 or better.
也许尝试回退到旧版本的 Nsight。
我最近 运行 遇到了这个问题,安装旧版本很有帮助。