Designing a parallel process for particle to mesh scheme, need help

How is your use case different from particle-in-cell (PIC) methods? There seems to be plenty of literature available that describes PIC implementations using CUDA.