printf behavior change on driver update

After installing the 285.05.33 driver, I noticed that the printf() statements inside global functions has changed their behavior. Now, the output seems to be buffered up, in front of a cudaDeviceSynchronize() command, and only after that command is completed, does the buffer flush.

Is there someway to get immediate print results (similar to using fflush(stdout)) ? Simply putting in the fflush call inside the global function will not work, of course.