In line 223 of train.py file. input and target should be both out[0] and label[0] if we want to use data after multi-scale fusion for training.