Few-shot Adaptive Faster R-CNN — Supplementary Material

Tao Wang,Xiaopeng Zhang,Li Yuan,Jiashi Feng
2019-01-01
Abstract:We run the experiments with Nvidia GeForce GTX TITAN X GPUs. We use PyTorch to implement the proposed model. All the network models, including the domain discriminators, are optimized with common SGD optimizer with momentum. The learning rate is set as 0.0001 for both the feature generator and domain discriminators. Each mini-batch contains 1 target domain image and 3 source domain images. The 4 images are sampled randomly. Features from the target domain image and one source domain image are paired to generate the source-target feature samples. Features from the other two source domain images are paired to form the source-source feature samples. The image-level domain discriminators have input batch size of 400, 200, 100 respectively for small, medium and large scale image-level adaptation module. A half of the input batch is composed of source-target pairs and the other half contains source-source pairs. The instance-level domain discriminator’s input batch size is not fixed and is based on the object instance presented in the batch sample. For the Cityscapes to Foggy Cityscapes scenario, to implement the instance-level adaptation module, we split the dataset according to object class. We feed image batch containing objects of one class for each update, the class of each update is randomly selected. When source model feature regularization is applied, one of the the three source images discussed above is fed through both source trained model and feature generator to calculate the regularization term.
What problem does this paper attempt to address?