Incorrect offset application in NPP ColorTwist conversion

matteo.est · November 4, 2024, 6:02pm

I’m trying to apply a custom conversion matrix to convert NV12 frames to RGB using nppiNV12ToRGB_8u_ColorTwist32f_P2C3R_Ctx with NPP 12.3.

The documentation mention that:

This is how the matrix works for the YUV420/YUV/422/NV12->RGB INVERSE
transform (note- do the offsets first):

  src[0]' = src[0] + aTwist[0][3]
  src[1]' = src[1] + aTwist[1][3]
  src[2]' = src[2] + aTwist[2][3]

And then the remaining 3x3 twist matrix is applied using those modified values:

dst[0] = aTwist[0][0] * src[0]' + aTwist[0][1] * src[1]' + aTwist[0][2] * src[2]'
dst[1] = aTwist[1][0] * src[0]' + aTwist[1][1] * src[1]' + aTwist[1][2] * src[2]'
dst[2] = aTwist[2][0] * src[0]' + aTwist[2][1] * src[1]' + aTwist[2][2] * src[2]'

However it seems that offsets are actually applied after the twist matrix, in the same way they are applied in the reverse transformation:

dst[0] = aTwist[0][0] * src[0] + aTwist[0][1] * src[1] + aTwist[0][2] * src[2] + aTwist[0][3]
dst[1] = aTwist[1][0] * src[0] + aTwist[1][1] * src[1] + aTwist[1][2] * src[2] + aTwist[1][3]
dst[2] = aTwist[2][0] * src[0] + aTwist[2][1] * src[1] + aTwist[2][2] * src[2] + aTwist[2][3]

This is a minimal example to repoduce the issue:

#include <stdio.h>
#include <cuda_runtime.h>
#include <npp.h>

#define SIZE 2

int main(int argc, char** argv) {

    Npp8u* yuv[2];
    int yuv_size[2] = {SIZE * SIZE, (SIZE / 2) * SIZE};
    int yuv_stride[2] = {SIZE, SIZE};
    cudaMalloc((void**) &yuv[0], yuv_size[0] * sizeof(Npp8u));
    cudaMalloc((void**) &yuv[1], yuv_size[1] * sizeof(Npp8u));

    cudaMemset(yuv[0], 128, yuv_size[0] * sizeof(Npp8u));
    cudaMemset(yuv[1], 128, yuv_size[1] * sizeof(Npp8u));

    Npp8u* rgb;
    int rgb_size = SIZE * SIZE * 3;
    int rgb_stride = SIZE;
    cudaMalloc((void**) &rgb, rgb_size * sizeof(Npp8u));

    Npp32f twist[3][4] = {
        {0,0,0,100},
        {0,0,0,0},
        {0,0,0,0},
    };

    NppStreamContext nppStreamCtx;
    nppGetStreamContext(&nppStreamCtx);
    NppiSize roi = {SIZE, SIZE};
    nppiNV12ToRGB_8u_ColorTwist32f_P2C3R_Ctx(yuv, yuv_stride, rgb, rgb_stride, roi, twist, nppStreamCtx);

    Npp8u* rgb_host = (Npp8u*) malloc (rgb_size);
    cudaMemcpy(rgb_host, rgb, rgb_size * sizeof(Npp8u), cudaMemcpyDeviceToHost);

    for (int i = 0; i < SIZE * SIZE; i++) {
        printf("%d) %d %d %d\n", i, rgb_host[i * 3], rgb_host[i * 3 + 1], rgb_host[i * 3 + 2]);
    }
    
    cudaFree(yuv[0]);
    cudaFree(yuv[1]);
    cudaFree(rgb);

    return 0;
}

If the offsets are applied before the twist matrix, the resulting RGB matrix should all contain zeros, but instead I get:

 0) 100 0 100
 1) 0 0 100
 2) 0 0 0
 3) 0 0 0

So it looks like the offsets are applied after the multiplications. This makes it impossible to use this function to convert from NV12.

jon9 · November 7, 2025, 3:19pm

Yep, have just discovered the same problem.

jon9 · November 7, 2025, 7:22pm

Looks like it’s fixed in NPP v13.0.

Topic		Replies	Views
How to convert YUV to RGB using CUDA NPP？ GPU-Accelerated Libraries npp	1	1783	September 18, 2021
How to convert YUV_NV12 to RGB using CUDA NPP？ CUDA-MEMCHECK cuda	1	1986	September 27, 2021
VPI/NvBufSurfTransform and NPP color conversion differences Jetson AGX Orin vpi	19	1171	March 7, 2024
RGB to NV12 GPU-Accelerated Libraries	2	2753	January 22, 2020
YUV420 to BGR conversion issue GPU-Accelerated Libraries npp	5	1276	June 25, 2023
NPP: Conversion from BGRA to YUV420 gives quarter of the result GPU-Accelerated Libraries	16	1929	January 24, 2020
nppiGetPerspectiveTransform() bug Problem with nppiGetPerspectiveTransform() library function CUDA Programming and Performance	9	2312	May 23, 2012
nppiRotate_8u_C1R and NPP_STEP_ERROR CUDA Programming and Performance	10	5800	March 8, 2012
Nvidia Performance Primitive (NPP) NV12 format GPU-Accelerated Libraries cuda	1	1260	February 9, 2023
Convert YUV NV12 to RGB24 packed, CUDA_ERROR_ILLEGAL_ADDRESS GPU-Accelerated Libraries	5	3262	December 9, 2019

Incorrect offset application in NPP ColorTwist conversion

Related topics