I need help with my CUDA project regarding implementation of AES algorithm. Now I am becoming depressed because of it.
Project is done, it is working properly but not as planned and I don’t know how to fix it. Problem is in time spent for execution. If someone could help I would be very grateful and I am even willing to pay for help because this is very important for me.
Let me explain more detailed.
There are two implementations, one for CPU and one for GPU. Both of them are working correctly but time spent on encryption in GPU implementation is taking too long. And I don’t know where is the problem.
Here is LINK to project
I would like to to know if my implementation is good. Does it meet rules and criteria for good parallel execution? What needs to be changed for faster execution, which memory to use for better performance etc. Because this is my first project in CUDA.
In both implementation there are files 1m.txt 2m.txt … 64m.txt with 1 million, 2 millions … 64 millions characters for encryption/decryption.
File 1m_info , 2m_info are files with informations about number of characters, time spent etc and files encrypted_text/decrypted_text are for verification on encryption/decryption.
Please help me, I am struggling for 3 months with this.
For any more information I am here to provide.