tensorRT3.0 concat result is not right with multi batch

GPU is P4, batchsize =2 ,concat layer with axis =1, find the second batch result is not right, like this:
[A][A] + [B][B] -> concat -> [A][B][A][ERROR]

We created a new “Deep Learning Training and Inference” section in Devtalk to improve the experience for deep learning and accelerated computing, and HPC users:
https://devtalk.nvidia.com/default/board/301/deep-learning-training-and-inference-/

We are moving active deep learning threads to the new section.

URLs for topics will not change with the re-categorization. So your bookmarks and links will continue to work as earlier.

-Siddharth

Please file a bug here: https://developer.nvidia.com/nvidia-developer-program
Please include the steps/files used to reproduce the problem along with the output of infer_device.