Create TensorRT net error using DataType::kHALF

fujiaweigege · December 26, 2017, 6:32am

I have built a SSD net using TensorRT 3.0 in TX2 with some plugin layers such as reshape, permute and so on. When I use DataType::kHALF to create the TensorRT net, it comes the error as follows:

ERROR: Internal error: could not find any implementation for node fc6 + relu6, try increasing the workspace size with IBuilder::setMaxWorkspaceSize()
ERROR: cudnnBuilder2.cpp (452) - OutOfMemory Error in buildSingleLayer
sample_SSD: SSD.cpp:108: void caffeToGIEModel(const string&, const string&, const std::vector<std::__cxx11::basic_string<char> >&, unsigned int, nvcaffeparser1::IPluginFactory*, nvinfer1::IHostMemory**): Assertion `engine' failed.
Aborted (core dumped)

I create the TensorRT net as follows:

void caffeToGIEModel(const std::string& deployFile,					// name for caffe prototxt
					 const std::string& modelFile,					// name for model
					 const std::vector<std::string>& outputs,		// network outputs
					 unsigned int maxBatchSize,						// batch size - NB must be at least as large as the batch we want to run with)
					 nvcaffeparser1::IPluginFactory* pluginFactory,	// factory for plugin layers
					 IHostMemory **gieModelStream)					// output stream for the GIE model
{
	// create the builder
	IBuilder* builder = createInferBuilder(gLogger);

	// parse the caffe model to populate the network, then set the outputs
	INetworkDefinition* network = builder->createNetwork();
	ICaffeParser* parser = createCaffeParser();
	parser->setPluginFactory(pluginFactory);

	bool fp16 = builder->platformHasFastFp16();
	
	std::cout << "Begin parsing model..." << std::endl;
	const IBlobNameToTensor* blobNameToTensor = parser->parse(locateFile(deployFile).c_str(),
						locateFile(modelFile).c_str(),
						*network,
				                fp16 ? nvinfer1::DataType::kHALF : nvinfer1::DataType::kFLOAT);
	std::cout << "End parsing model..." << std::endl;
	// specify which tensors are outputs
	for (auto& s : outputs)
		network->markOutput(*blobNameToTensor->find(s.c_str()));

	// Build the engine
	builder->setMaxBatchSize(maxBatchSize);
	builder->setMaxWorkspaceSize(10 << 20);	// we need about 6MB of scratch space for the plugin layer for batch size 5
	builder->setHalf2Mode(fp16);
	ICudaEngine* engine = builder->buildCudaEngine(*network);
	assert(engine);	
	std::cout << "End building engine..." << std::endl;

	// we don't need the network any more, and we can destroy the parser
	network->destroy();
	parser->destroy();

	// serialize the engine, then close everything down
	(*gieModelStream) = engine->serialize();

	engine->destroy();
	builder->destroy();
	shutdownProtobufLibrary();
}

I set the setMaxWorkspaceSize lager such as “16<<20” or even lager, it also comes the same error.
When I set fp16=false, it runs successfully.
Could someone give me some suggestions? Thank you in advance!

AastaLLL · December 26, 2017, 9:19am

Hi,

This is a DeepStream for Tesla board. For TX2 issue, please file topic here:

For this issue, could you set the setMaxBatchSize() smaller and give it a try.
This may be a known issue but requiring further confirming.

Thanks.

tianfangzhang · December 28, 2017, 1:43am

Hello! are some of you able to share the code for this? I have never done CUDA or TensorRT before, so it would be really helpful.

AastaLLL · January 2, 2018, 6:44am

Hi, tianfangzhang

We provide lots of sample for CUDA/TensorRT/DeepStream.
Please check the following path for the samples you want:

CUDA:

/usr/local/cuda-9.0/bin/cuda-install-samples-9.0.sh .
cd NVIDIA_CUDA-9.0_Samples
make

TensorRT:

cp -r /usr/src/tensorrt/ .
cd tensorrt/samples/
make

DeepStream:

cd deepstream/samples/decPerf
./run.sh

Thanks and Happy New Year : )

Topic		Replies	Views
Memory Issue with Half2Mode in TensorRT 3 Jetson TX2	16	1994	October 18, 2021
TensorRT fails to build engine for network constructed using C++ API when setHalf2Mode(true) GPU-Accelerated Libraries	7	2620	March 16, 2018
Data type for TensorRT engine created from UFF model with DataType.HALF TensorRT	2	1326	May 2, 2018
TensorRT Half2 Accuracy Issue Jetson TX1	5	956	October 18, 2021
TensorRT 2.1 OutOfMemory Error in buildSingleLayer Jetson TX2	11	2827	October 18, 2021
Tensorflow 1.7 with TensorRT fails Jetson TX2	13	3994	October 18, 2021
Error loading custom model using imagenet-console from jetson-inference Jetson TX1	5	1055	October 18, 2021
Segmentation fault (core dumped) while doing Tensorrt optimization of lenet Jetson TX2	6	6357	October 18, 2021
could not find any implementation for node 2-layer MLP, try increasing the workspace size with IBuilder::setMaxWorkspaceSize() TensorRT	4	3833	October 12, 2021
[TensorRT] ERROR: Internal error: could not find any implementation for node (Unnamed Layer* 25) [Deconvolution], try increasing the workspace size with IBuilder::setMaxWorkspaceSize() TensorRT	2	4601	October 12, 2021

Create TensorRT net error using DataType::kHALF

Related topics