Morganh. thanks for replying, last night, testing different changes on the spec file reduced the batch size to 32. “Suddenly” it worked. So indeed that’s the solution to this problem. Now that you suggest that as the first action to take. Is there a reason why X amount of images requires Y batch-size?
Is there a dependency of batch size on the accuracy of the model? How should I choose the appropriate batch size for my training?
As a common practice, a small batch size or single GPU is preferred for a small dataset; while a large batch size or multiple GPUs is preferred for a large dataset.