I’m choosing the topic to make my Thesis. Please let me know your opinion about how feasible and convenient to do this under the CUDA-GPU Technology.

I want to implement 10.000 Cores, each of them calculating parallelly and independently an equation like this (for example) : A = B^2 + 1 / C, where A, B and C are real always positive numbers: 15.38, 0.459988, etc.

when all 10.000 cores finish, they communicate with each other in a simple “torus” network to summarize their results; and start again.

Roughly, can you say this makes senses to do in CUDA? Using a GPU and CUDA development software, how difficult would it be to implement? My programming skills are fair (assembly, C), but I have never worked with CUDA or GPUs, I’m not sure about the Technology limitations and other aspects.

I count with few months to finish a project like this, That’s why I ask here, to see the feasibility of this project.

Finally, do you know a another forum for CUDA beside NVIDIA website?