Group Sampling for Scale Invariant Face Detection

Detectors based on deep learning tend to detect multi-scale objects on a single input image for efficiency. Recent works, such as FPN and SSD, generally use feature maps from multiple layers with different spatial resolutions to detect objects at different scales, e.g., high-resolution feature maps...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 2 vom: 15. Feb., Seite 985-1001
1. Verfasser:	Ming, Xiang (VerfasserIn)
Weitere Verfasser:	Wei, Fangyun, Zhang, Ting, Chen, Dong, Zheng, Nanning, Wen, Fang
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2022
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article


LEADER	01000naa a22002652 4500
001	NLM313260753
003	DE-627
005	20231225150210.0
007	cr uuu---uuuuu
008	231225s2022 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1109/TPAMI.2020.3012414 \|2 doi
028	5	2	\|a pubmed24n1044.xml
035			\|a (DE-627)NLM313260753
035			\|a (NLM)32750835
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Ming, Xiang \|e verfasserin \|4 aut
245	1	0	\|a Group Sampling for Scale Invariant Face Detection
264		1	\|c 2022
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 28.03.2022
500			\|a Date Revised 01.04.2022
500			\|a published: Print-Electronic
500			\|a Citation Status MEDLINE
520			\|a Detectors based on deep learning tend to detect multi-scale objects on a single input image for efficiency. Recent works, such as FPN and SSD, generally use feature maps from multiple layers with different spatial resolutions to detect objects at different scales, e.g., high-resolution feature maps for small objects. However, we find that objects at all scales can also be well detected with features from a single layer of the network. In this paper, we carefully examine the factors affecting detection performance across a large range of scales, and conclude that the balance of training samples, including both positive and negative ones, at different scales is the key. We propose a group sampling method which divides the anchors into several groups according to the scale, and ensure that the number of samples for each group is the same during training. Our approach using only one single layer of FPN as features is able to advance the state-of-the-arts. Comprehensive analysis and extensive experiments have been conducted to show the effectiveness of the proposed method. Moreover, we show that our approach is favorably applicable to other tasks, such as object detection on COCO dataset, and to other detection pipelines, such as YOLOv3, SSD and R-FCN. Our approach, evaluated on face detection benchmarks including FDDB and WIDER FACE datasets, achieves state-of-the-art results without bells and whistles
650		4	\|a Journal Article
700	1		\|a Wei, Fangyun \|e verfasserin \|4 aut
700	1		\|a Zhang, Ting \|e verfasserin \|4 aut
700	1		\|a Chen, Dong \|e verfasserin \|4 aut
700	1		\|a Zheng, Nanning \|e verfasserin \|4 aut
700	1		\|a Wen, Fang \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on pattern analysis and machine intelligence \|d 1979 \|g 44(2022), 2 vom: 15. Feb., Seite 985-1001 \|w (DE-627)NLM098212257 \|x 1939-3539 \|7 nnns
773	1	8	\|g volume:44 \|g year:2022 \|g number:2 \|g day:15 \|g month:02 \|g pages:985-1001
856	4	0	\|u http://dx.doi.org/10.1109/TPAMI.2020.3012414 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|d 44 \|j 2022 \|e 2 \|b 15 \|c 02 \|h 985-1001