Teaching an AI to Detect Key Actors in Multi-person Videos

Originally published at: Teaching an AI to Detect Key Actors in Multi-person Videos | NVIDIA Technical Blog

Researchers from Google and Stanford have taught their computer vision model to detect the most important person in a multi-person video scene – for example, who the shooter is in a basketball game which typically contains dozens or hundreds of people in a scene. Using 20 Tesla K40 GPUs and the cuDNN-accelerated Tensorflow deep learning…