TY - JOUR
T1 - Monocular Human Depth Estimation Via Pose Estimation
AU - Jun, Jinyoung
AU - Lee, Jae Han
AU - Lee, Chul
AU - Kim, Chang Su
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2021
Y1 - 2021
N2 - We propose a novel monocular depth estimator, which improves the prediction accuracy on human regions by utilizing pose information. The proposed algorithm consists of two networks - PoseNet and DepthNet - to estimate keypoint heatmaps and a depth map, respectively. We incorporate the pose information from PoseNet to improve the depth estimation performance of DepthNet. Specifically, we develop the feature blending block, which fuses the features from PoseNet and DepthNet and feeds them into the next layer of DepthNet, to make the networks learn to predict the depths of human regions more accurately. Furthermore, we develop a novel joint training scheme using partially labeled datasets, which balances multiple loss functions effectively by adjusting weights. Experimental results demonstrate that the proposed algorithm can improve depth estimation performance significantly, especially around human regions. For example, the proposed algorithm improves the depth estimation performance on the human regions of ResNet-50 by 2.8% and 7.0% in terms of $\delta _{1}$ and RMSE, respectively, on the proposed HD + P dataset.
AB - We propose a novel monocular depth estimator, which improves the prediction accuracy on human regions by utilizing pose information. The proposed algorithm consists of two networks - PoseNet and DepthNet - to estimate keypoint heatmaps and a depth map, respectively. We incorporate the pose information from PoseNet to improve the depth estimation performance of DepthNet. Specifically, we develop the feature blending block, which fuses the features from PoseNet and DepthNet and feeds them into the next layer of DepthNet, to make the networks learn to predict the depths of human regions more accurately. Furthermore, we develop a novel joint training scheme using partially labeled datasets, which balances multiple loss functions effectively by adjusting weights. Experimental results demonstrate that the proposed algorithm can improve depth estimation performance significantly, especially around human regions. For example, the proposed algorithm improves the depth estimation performance on the human regions of ResNet-50 by 2.8% and 7.0% in terms of $\delta _{1}$ and RMSE, respectively, on the proposed HD + P dataset.
KW - Monocular depth estimation
KW - human depth estimation
KW - human pose estimation
KW - loss rebalancing strategy
UR - http://www.scopus.com/inward/record.url?scp=85119720337&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2021.3126629
DO - 10.1109/ACCESS.2021.3126629
M3 - Article
AN - SCOPUS:85119720337
SN - 2169-3536
VL - 9
SP - 151444
EP - 151457
JO - IEEE Access
JF - IEEE Access
ER -