ViTPose++ : Vision Transformer for Generic Body Pose Estimation
In this paper, we show the surprisingly good properties of plain vision transformers for body pose estimation from various aspects, namely simplicity in model structure, scalability in model size, flexibility in training paradigm, and transferability of knowledge between models, through a simple bas...
Publié dans: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 46(2024), 2 vom: 01. Feb., Seite 1212-1230 |
---|---|
Auteur principal: | |
Autres auteurs: | , , |
Format: | Article en ligne |
Langue: | English |
Publié: |
2024
|
Accès à la collection: | IEEE transactions on pattern analysis and machine intelligence |
Sujets: | Journal Article |
Accès en ligne |
Volltext |