adaboost on CUDA

Hi all,
Has anyone tried adaboost algorithm for face detection on CUDA? I am currently working on it but I cannot get good performance by using CUDA. If someone has some idea about it I want to know how to handle the shared memory for Adaboost