【发布时间】:2020-09-10 22:13:55
【问题描述】:
今天我开始使用 CUDA 和 GPU 处理。我找到了这个教程: https://www.geeksforgeeks.org/running-python-script-on-gpu/
不幸的是,我第一次尝试运行 gpu 代码失败了:
from numba import jit, cuda
import numpy as np
# to measure exec time
from timeit import default_timer as timer
# normal function to run on cpu
def func(a):
for i in range(10000000):
a[i]+= 1
# function optimized to run on gpu
@jit(target ="cuda")
def func2(a):
for i in range(10000000):
a[i]+= 1
if __name__=="__main__":
n = 10000000
a = np.ones(n, dtype = np.float64)
b = np.ones(n, dtype = np.float32)
start = timer()
func(a)
print("without GPU:", timer()-start)
start = timer()
func2(a)
print("with GPU:", timer()-start)
输出:
/home/amu/anaconda3/bin/python /home/amu/PycharmProjects/gpu_processing_base/gpu_base_1.py
without GPU: 4.89985659904778
Traceback (most recent call last):
File "/home/amu/PycharmProjects/gpu_processing_base/gpu_base_1.py", line 30, in <module>
func2(a)
File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/cuda/dispatcher.py", line 40, in __call__
return self.compiled(*args, **kws)
File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/cuda/compiler.py", line 758, in __call__
kernel = self.specialize(*args)
File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/cuda/compiler.py", line 769, in specialize
kernel = self.compile(argtypes)
File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/cuda/compiler.py", line 785, in compile
**self.targetoptions)
File "/home/amu/anaconda3/lib/python3.7/site-packages/numba/core/compiler_lock.py", line 32, in _acquire_compile_lock
return func(*args, **kwargs)
TypeError: compile_kernel() got an unexpected keyword argument 'boundscheck'
Process finished with exit code 1
我已经在pycharm的anaconda环境中安装了教程中提到的numba和cudatoolkit。
【问题讨论】:
-
您从该教程中复制的代码是错误的并且不起作用。我的建议是找到更好的教程
-
考虑改用 C/C++,按照这里的官方教程:developer.nvidia.com/how-to-cuda-c-cpp
-
总结一下——“优化为在 gpu 上运行的函数”可能应该用
@vectorize装饰器而不是@jit装饰。后者意味着您正在编写一个 CUDA 内核,在这种情况下,函数内的代码和函数调用本身都需要进行重大更改 -
@Hack06:鉴于这基本上是一个 Python 加速练习,这似乎不是特别有用或建设性的建议。
-
问题是用python标记的,代码是python,还有一个关于用numba加速python的教程的链接。它需要变得多明显?