How can I make a pixel parallel access with cuda?

arer90 · November 21, 2018, 4:13am

Hello, developer!
I am beginner of cuda programming but i am trying to make myself.
But I found that i have a limited knowledge of mine with Cuda, Graphics or else.
In this case, I want to make up GPU image subtraction with OpenCV in Cuda 10.0 with Visual Studio 2017 to show GPU parallel process is much faster than CPU sequence process.

BUT i have really trouble with parallel access with images.
Using a pixel from each of images access to GPU is very hard to find a way.
PLEASE check my code and tell me what is wrong or tell me what should i fix in my code.

// basic header
#include <iostream>
#include <cstdlib>
#include <ctime>

// Cuda header
#include <cuda_runtime.h>
#include <device_launch_parameters.h>

// OpenCV header
#include <opencv.hpp>
#include <opencv2/core/cuda.hpp>
#include <opencv2/highgui/highgui.hpp>
#include <opencv2/imgproc/imgproc.hpp>
#include <opencv2/core/cuda/common.hpp>

using namespace std;
using namespace cv;

void GPUimageSubtract(Mat *img1, Mat *img2, int row, int col, int row2, int col2) {
	int x = threadIdx.x + blockIdx.x*blockDim.x;
	int y = threadIdx.y + blockIdx.y*blockDim.y;
	int offset = x + y * blockDim.x*gridDim.x;  // <-- I don't know what is meaning for?

	/*
		I don't know about this part.
                Give me a hint for understanding
	*/

}

int main(){
	
	Mat image1 = imread("sky.jpg");
	Mat image2 = imread("sky1.jpg");
	Mat CPU_res, GPU_res;
	if (image1.empty() || image2.empty()) {
		cout << "Cannot open the image file." << endl;
		return 0;
	}

	clock_t begin = clock();		// Time Start

	cv::subtract(image1, image2, CPU_res);	// CPU operation

	clock_t end = clock();			// Time End
	double esec = double(end - begin) / CLOCKS_PER_SEC;

	Mat *dev_image1, *dev_image2;
	if ( cudaMalloc((void**)dev_image1, sizeof(Mat)) != cudaSuccess) {
		cout << "Error with dev_image1" << endl;
		return 0;
	}
	if (cudaMalloc((void**)dev_image2, sizeof(Mat)) != cudaSuccess) {
		cout << "Error with dev_image2" << endl;
		return 0;
	}

	cudaMemcpy(dev_image1, &image1, sizeof(image1), cudaMemcpyHostToDevice);
	cudaMemcpy(dev_image2, &image2, sizeof(image1), cudaMemcpyHostToDevice);
	
	cudaEvent_t start, stop;
	float esec2;
	cudaEventCreate(&start);
	cudaEventCreate(&stop);
	cudaEventRecord(start, 0);		// cuda time start

	GPUimageSubtract <<< >> > (dev_image1, dev_image2, image1.rows, image1.cols, image2.rows, image2.cols);
        // How can i know about fitting blocks, threads?

	cudaEventRecord(stop, 0);		
	cudaEventSynchronize(stop);		// cuda time end
	cudaEventElapsedTime(&esec2, start, stop);

	cudaMemcpy(&GPU_res, dev_image1, sizeof(dev_image1), cudaMemcpyDeviceToHost);

	cout << fixed;
	cout.precision(10);
	cout << "CPU = " << esec << endl;
	cout << "GPU = " << esec2 << endl;

	imshow("CPU_res", CPU_res);
	imshow("GPU_res", GPU_res);
	waitKey(0);

	cudaFree(dev_image1);
	cudaFree(dev_image2);

	return 0;
}

I know this code is really stupid and silly but this is what i can do best now.
BUT I want to know why and how very well!

Thank you!

program3.cu (2.31 KB)

Topic		Replies	Views
How can I extract sub image (pixels) using CUDA/C++? CUDA Programming and Performance	0	435	July 16, 2020
Image sequence processing CUDA Programming and Performance	2	927	August 22, 2013
Please tell me what did i wrong with my cuda code or else? CUDA Programming and Performance	2	2254	November 19, 2018
CUDA image processing Accelaration tips anyone? CUDA Programming and Performance	20	6280	November 16, 2010
Manipulating image pixels stored in 1D array CUDA Programming and Performance	1	5648	May 7, 2010
a problem about parallel programming in CUDA Jetson TK1	2	752	October 18, 2021
CUDA thread in background? CUDA Programming and Performance	10	16188	February 19, 2010
Extract overlapping image patches from an image in CUDA CUDA Programming and Performance cuda , kernel	5	663	May 31, 2022
Need help for image processing in parallel Teaching & Curriculum Support	2	1401	March 13, 2015
Accessing RGB channel of the image in CUDA-OpenCV program CUDA Programming and Performance opencv	0	1031	February 6, 2018

How can I make a pixel parallel access with cuda?

Related topics