I would try to run it without bumblebee. I think it will be a little bit faster and you do not need it for cuda.
Dear Moises,
Thank you for your response.
The data has the following parameters:
dim1 256
dim2 256
dim3 27
dim4 26
My gpu is: NVIDIA GeForce 710M
Kázmér