Position Puzzle Network and Augmentation: localizing human keypoints beyond the bounding box

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 57
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorPark, Soonchanko
dc.contributor.authorPark, Jinahko
dc.date.accessioned2023-11-02T03:00:24Z-
dc.date.available2023-11-02T03:00:24Z-
dc.date.created2023-11-01-
dc.date.created2023-11-01-
dc.date.created2023-11-01-
dc.date.created2023-11-01-
dc.date.issued2023-11-
dc.identifier.citationMACHINE VISION AND APPLICATIONS, v.34, no.6-
dc.identifier.issn0932-8092-
dc.identifier.urihttp://hdl.handle.net/10203/314097-
dc.description.abstractWhen estimating human pose with a partial image of a person, we, humans, do not confine the spatial range of our estimation to the given image and can readily localize keypoints outside of the image by referring to visual clues such as the body size. However, computational methods for human pose estimation do not consider those keypoints outside and focus only on the bounded area of a given image. In this paper, we propose a neural network and a data augmentation method to extend the range of human pose estimation beyond the bounding box. While our Position Puzzle Network expands the spatial range of keypoint localization by refining the position and the size of the target’s bounding box, Position Puzzle Augmentation enables the keypoint detector to estimate keypoints not only within, but also beyond the input image. We show that the proposed method enhances the baseline keypoint detectors by 39.5% and 30.5% on average in mAP and mAR, respectively, by enabling the localization of keypoints out of the bounding box using a cropped image dataset prepared for proper evaluation. Additionally, we verify that the proposed method does not degrade the performance under the original benchmarks and instead, improves the performance by alleviating false-positive errors.-
dc.languageEnglish-
dc.publisherSPRINGER-
dc.titlePosition Puzzle Network and Augmentation: localizing human keypoints beyond the bounding box-
dc.typeArticle-
dc.identifier.wosid001091876500001-
dc.identifier.scopusid2-s2.0-85175019646-
dc.type.rimsART-
dc.citation.volume34-
dc.citation.issue6-
dc.citation.publicationnameMACHINE VISION AND APPLICATIONS-
dc.identifier.doi10.1007/s00138-023-01471-6-
dc.contributor.localauthorPark, Jinah-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorHuman keypoint detection-
dc.subject.keywordAuthorHuman pose estimation-
dc.subject.keywordAuthorData augmentation-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0