谁能解释一下 Feature Pyramid Networks for Object Detection 论文中的 "non-sliding window" 语句？

Can anyone explain about the "non-sliding window" statement in Feature Pyramid Networks for Object Detection paper?

用于目标检测的特征金字塔网络采用 RPN 技术创建检测器，并使用滑动 window 技术进行分类。 5.2 节怎么会有 "non-sliding window" 的语句？

论文中的扩展语句： 5.2。使用 Fast/Faster R-CNN 进行目标检测接下来我们研究基于区域（非滑动 window）检测器的 FPN。

在我的理解中，FPN在检测任务中使用滑动window。这在中也提到 https://medium.com/@jonathan_hui/understanding-feature-pyramid-networks-for-object-detection-fpn-45b227b9106c 语句是

"FPN extracts feature maps and later feeds into a detector, says RPN, for object detection. RPN applies a sliding window over the feature maps to make predictions on the objectness (has an object or not) and the object boundary box at each location."

先谢谢你。

用于对象检测的特征金字塔网络 (FPN) 不是 RPN。

FPN 只是一种更好的特征提取方法。它结合了几个阶段的特征，为对象检测管道的其余部分提供了更好的特征（特别是因为它结合了第一阶段的特征，为检测 small/medium 大小的对象提供了更好的特征）。

正如原始论文所述：“我们的目标是利用 ConvNet 的金字塔特征层次结构，它具有从低到高级别的语义，以及构建一个始终具有高级语义的特征金字塔。由此产生的特征金字塔网络是 general purpose 并且在本文中我们专注于滑动 window proposers（Region Proposal Network，简称 RPN）[29] 和基于区域的检测器 (Fast R-CNN)"

所以他们用它来检查 "Two stage" 对象检测管道。第一阶段是 RPN，这是他们在 5.1 节中检查的内容，然后他们在 5.2 节中检查分类阶段。

Fast R-CNN Faster R-CNN 等是基于区域的对象检测器，而不是滑动 window 检测器。他们从 RPN 中获得一组固定的区域来进行分类，仅此而已。

您可以在 https://medium.com/@jonathan_hui/what-do-we-learn-from-region-based-object-detectors-faster-r-cnn-r-fcn-fpn-7e354377a7c9 看到关于差异的很好解释。

谁能解释一下 Feature Pyramid Networks for Object Detection 论文中的 "non-sliding window" 语句？

Can anyone explain about the "non-sliding window" statement in Feature Pyramid Networks for Object Detection paper?

computer-vision

deep-learning

pytorch

retinanet