I removed bank conflicts in my program and I’m checking that through CUT_BANK_CHECKER which reports 0 bank conflicts in my kernel code. But I got no performance gain after that !!
Has any anyone encountered this before or have any clue what’s going on?
I would appreciate any help, I’m kind of stuck with this issue …