cudnn_cbr_sample.cpp (24.2 KB)
Finally, I got this pattern run-through. Some changes might be highlighted:
- The cross-correlation algorithm should be used;
- The first global engine requires
NHWCtensor layout; -
CUDNN_ATTR_EXECUTION_PLAN_HANDLEis a required attribute in plan descriptor;
@ryantong FYI.