I agree, it's oversampling, undersampling or cost function. Augmenting an image with lots of labels from different classes would be difficult. Choosing cost function like you said would be simpler. Great to confirm that one on your answer! Thanks Piotr.