Why a program the runtime in GTX680 slower than GTX 570

Hello, someone can tell me the reason why a program the runtime in GTX680 slower than GTX 570,the program I run in cuda toolkitV4.0

There are a number of possible reasons, one being that Kepler needs more blocks to fully load the device.

Read the Kepler Tuning Guide to learn about others.