Jetson TX1 Floating Point performance

I want to know how many 16bit/32bit floating point operations can a TX1 perform per second? What representation of floating point is used?

1TFLOP/s with FP16. From the many press-releases on the TX1.