PPDM++ : Parallel Point Detection and Matching for Fast and Accurate HOI Detection
Human-Object Interaction (HOI) detection aims to understand human activities by detecting interaction triplets. Previous HOI detection methods adopt a two-stage instance-driven paradigm. Unfortunately, many non-interactive human-object pairs generated by the first stage are the main obstacle impedin...
| Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 46(2024), 10 vom: 10. Okt., Seite 6826-6841 | 
|---|---|
| 1. Verfasser: | |
| Weitere Verfasser: | , , , , , | 
| Format: | Online-Aufsatz | 
| Sprache: | English | 
| Veröffentlicht: | 
            
            2024
         | 
| Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence | 
| Schlagworte: | Journal Article | 
| Zusammenfassung: | Human-Object Interaction (HOI) detection aims to understand human activities by detecting interaction triplets. Previous HOI detection methods adopt a two-stage instance-driven paradigm. Unfortunately, many non-interactive human-object pairs generated by the first stage are the main obstacle impeding HOI detectors from high efficiency and promising performance. To remedy this, we propose a novel top-down interaction-driven paradigm, detecting interactions first and bridging interactive human-object pairs through interactions. We formulate HOI as a point triplet human point, interaction point, object point and design a Parallel Point Detection and Matching (PPDM) framework. We further take advantage of two-stage methods and propose a novel framework, PPDM++, that detects the interactive human-object pairs by PPDM, then extracts region features for each pair to predict actions. The core of PPDM/PPDM++ is to convert the instance-driven bottom-up paradigm to an interaction-driven top-down paradigm, thus avoiding additional computation costs from traversing a tremendous number of non-interactive pairs. Benefiting from the advanced paradigm, PPDM/PPDM++ has achieved significant performance gains with high efficiency. PPDM-DLA-34 has achieved 19.94 mAP with 42 FPS as the first real-time HOI detector, and PPDM++-SwinB achieves 30.1 mAP with 17 FPS on HICO-DET dataset. We also built an application-oriented database named HOI-A, a supplement to the existing datasets | 
|---|---|
| Beschreibung: | Date Revised 06.09.2024 published: Print-Electronic Citation Status PubMed-not-MEDLINE  | 
| ISSN: | 1939-3539 | 
| DOI: | 10.1109/TPAMI.2024.3386891 |