My project involves a bunch of small GF(2^8) field multiplications and a bunch of xoring between them. Besides from making it easier to implement, I’m wondering if that would help efficiency.
Would it be possible to use some OpenGL calls to move these fairly big lookup tables(about 9 8-bit arrays of 256 values) into a “read only” section? Since each block would be using these tables could it still be efficiently parallel?
I have a hard time of judging when lookup tables are a good idea, any general advice? I didn’t notice a ton of info in the manual.