SSD 盗梦空间 v2. VGG16 特征提取器是否被 Inception v2 取代？

Question

原文中SSD paper they used a VGG16 network to the the feature extraction. I am using the SSD Inception v2 model from the TensorFlow model zoo and I do not know what the difference in architecture is. This stack overflow post建议对于SSD MobileNet等其他模型，将VGG16特征提取器替换为MobileNet特征提取器。

我认为这与 SSD Inception 的情况相同，但 this paper has me confused. From here it seems that the Inception is added to the SSD part of the model and the VGG16 feature extractor remains in the beginning of the architecture.

SSD Inception v2模型的架构是什么？

Answer 1

在tensorflow目标检测api中，ssd_inception_v2模型使用inception_v2作为特征提取器，即第一个图中的vgg16部分（图（a )) 替换为 inception_v2.

在ssd模型中，由feature extractor（即vgg16、inception_v2、mobilenet）提取的特征层将被进一步处理以产生不同分辨率的额外特征层。上图（a）中，有6个输出特征层，前两个（19x19）直接取自feature extractor。其他 4 层（10x10、5x5、3x3、1x1）是如何生成的？

它们是由额外的卷积运算生成的（这些卷积运算有点像使用非常浅的特征提取器，不是吗？）。实施细节 here 随良好文档一起提供。在文档中它说

Note that the current implementation only supports generating new layers 
using convolutions of stride 2 (resulting in a spatial resolution reduction 
by a factor of 2)

这就是额外的特征图减少2倍的方式，如果你阅读函数multi_resolution_feature_maps，你会发现使用了slim.conv2d操作，这表明获得了这些额外的层带有额外的卷积层（每个只有一层！）。

现在我们可以解释您链接的论文中的改进之处。 他们提议用初始块替换额外的特征层。没有 inception_v2 模型，只有一个初始块。 论文报道了通过使用初始块提高分类准确率。

现在问题应该清楚了，vgg16、inceptioin_v2或mobilenet的ssd模型都可以，但是inception这篇论文只提到了一个初始块，而不是初始网络。

SSD 盗梦空间 v2. VGG16 特征提取器是否被 Inception v2 取代？

SSD Inception v2. Is the VGG16 feature extractor replaced by the Inception v2?

machine-learning

object-detection

deep-learning

tensorflow

object-detection-api