I have a library of 10 million photos from sporting events that we cover and I want to train an AI when to trigger (take the photo) based on the content of video coming in from a webcam. This is sport specific. I’m an experienced web developer, primarily JS and PHP.
The solution must be offline because the venues are often in locations with low or slow internet.
Based on my research, it would be best if I had unsupervised training (feed all of the images in) and then supervised training (show it images that are positive (take the shot), and images that are negative (don’t take the shot).
Then, I need some hardware and a laptop to read the input and trigger the API calls to the camera to take the shot.
So:
- Train a model using the 10 million photos (I’m not sure what to do there, there’s so many options, but I need it to be offline.
- Purchase hardware for on-site processing. (What hardware do I need? Will a laptop work?)
- Build the API integration with Sony’s camera AI (I can handle this already no problem).
So for 1 + 2, what’s the best way for me to proceed?