In our system (Linux), many process use GPU(1-3 Tesla) with Mathinelaerning-classify. And we care real-time. I want to design a mechanism that schedule the priority of process’s GPU request(compute job,exmp: surounding-env detect, cv-detection,segmention process,classify process,etc). But after soming reading and coding, I realize this solution might something wrong.
any suggestion? ;(