Very Deep Convolutional Networks for Large-Scale Image Recognition
Citations
755 citations
Cites methods from "Very Deep Convolutional Networks fo..."
...Yang et al. [57] (SEU-NJU team, 4th place in LAP challenge) use face and landmark detection for face alignment and the VGG-16 architecture [50] for modeling....
[...]
...Thereby each filter in VGG-16 captures simpler geometrical structures but in comparison allows more complex reasoning through its increased depth....
[...]
...For our convolutional neural networks (CNNs) we use the deep VGG-16 architecture [48]....
[...]
...Our method uses a CNN with the VGG-16 (Simonyan and Zisserman 2014) architecture [cf....
[...]
...3.4 Output layer and expected value The pre-trained CNN (with VGG-16 architecture) for the ImageNet classification task has an output layer of 1000 softmax-normalized neurons, one for each of the object classes....
[...]
753 citations
Cites background from "Very Deep Convolutional Networks fo..."
...There is a common pattern in the design of most SOD networks [18, 27, 41, 6], that is, they focus on making good use of deep features extracted by existing backbones, such as Alexnet [17], VGG [35], ResNet [12], ResNeXt [44], DenseNet [15], etc....
[...]
...In modern CNN designs, such as VGG, ResNet, DenseNet and so on, small convolutional filters with size of 1×1 or 3×3 are the most frequently used components for feature extraction....
[...]
...Practically, we adapt the backbones (VGG-16 and ResNet50) by adding an extra stage after their last convolutional stages to achieve the same receptive fields with our original U2-Net architecture design....
[...]
...To validate the backbone free design, we conduct ablation studies on replacing the encoder part of our full size U2-Net with different backbones: VGG16 and ResNet50....
[...]
...Different from the previous salient object detection models which use backbones (e.g. VGG, ResNet, etc.) as their encoders, our newly proposed U2-Net architecture is backbone free....
[...]
752 citations
749 citations
Cites background or methods from "Very Deep Convolutional Networks fo..."
...than Fast R-CNN (1830ms) with the same VGG [26] backbone, and processing rate was 5fps vs....
[...]
...As well, total running time of Faster R-CNN (198ms) is nearly 10 times lower than Fast R-CNN (1830ms) with the same VGG [24] backbone, and processing rate is 5fps vs. 0.5fps. D. Mask R-CNN Mask R-CNN [9] is an extending work to Faster R-CNN mainly for instance segmentation task....
[...]
...Experiments showed that SSD512 had a competitive result both mAP and speed with VGG-16 [24] backbone....
[...]
...Experiments showed that SSD512 had a competitive result on both mAP and speed with VGG-16 [26] backbone....
[...]
...For M2Det is an one-stage detector, it achieves AP of 41.0 at speed of 11.8 FPS with single-scale inference strategy and AP of 44.2 with multi-scale inference strategy utilizing VGG-16 on COCO test-dev set....
[...]
747 citations
References
49,639 citations
21,729 citations
9,803 citations
9,775 citations
6,061 citations