-
Notifications
You must be signed in to change notification settings - Fork 106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CL_OUT_OF_RESOURCES error on Nvidia GTX 1060 copying position buffer from device #136
Comments
@jhurlbut I didn't run Sibernetic on a Windows machine yet. With Ubuntu, I think I got the same error after nvidia released and shipped CUDA 9 with the graphics driver package. After downgrading to an older version (375.82) it worked again. @a-palyanov @skhayrulin Any comments? |
@lungd you can install clinfo utility (sudo apt-get install clinfo), than after installation newest driver (with which you have a problem) you can run it and it will show you all available OpenCL devices if your NVIDIA card there than the problem is on sibernetic side if it's not than something wrong with new NVIDIA driver, also could you please give information about this driver? |
@skhayrulin @jhurlbut Do you get the same error? |
I tried installing Cuda 7.5 next to the Cuda v9 already installed. The same error occured with Cuda v9. The nvidia driver version is 385.54. Yes I get that same error "ERROR: Could not enqueue read data from buffer error code is error code is -5" which after looking up online is the CUDA error CL_OUT_OF_RESOURCES which I understand is usually an issue of trying to read a buffer of a larger size than was written into the buffer. Is there a version of clinfo for Windows? Google only comes up with linux installs. I will try 375.82 driver version. @lungd do you recall which version of CUDA comes with that driver? Thanks! |
@jhurlbut with 375.82 I get the following output:
So if you can see CUDA 8 it should work. BTW, I didn't install the cuda toolkit only the nvidia driver. |
I am experiencing a similar issue while I've been up for running the ow-0.9.5 code on Ubuntu 22.04. Platform Name NVIDIA CUDA and the following was the error log I've seen: [[ Step 112934 (total steps: unlimited, t in sim: 2.25868s) dt: 2e-05 (in s), time elapsed: 40.42 (in min) ]] Could this possibly be caused because of my hardware? My laptop has Ryzen 8645HS and 16gb RAM. |
@LucianSheen I don't thilk that changing version of drivers will help did you have any chance to check how much resources did sibernetic use before crash? Maybe some sort of memory leak... |
@skhayrulin I ran the simulation while monitoring both memory and GPU. System memory usage has been around 2.3 gb/ 16 gb while GPU memory was 298 Mb / 8188 Mb (sorry I was mistaken, my gpu has 8gb memory not 16) during the whole simulation and never spiked. If the memory isn't the problem here should I install a CUDA toolkit separately when I already installed NVIDIA driver do you think? NVIDIA-smi showed GPU 00000000:01:00.0: Detected Critical Xid Error when the crash happened. AND - I don't experience such issues with worm_crawl_half_resolution, but when I run -f worm_crawling. EDITED: installed CUDA toolkit but got stuck at the very same step of 112934. |
I am able to run the simulation on the CPU and Intel HD GPU but when running on my Nvidia GPU on the same machine I get a CL_OUT_OF_RESOURCES error when the code tries to copy the position buffer from device. This is on Windows 10.
The text was updated successfully, but these errors were encountered: