TY - JOUR
T1 - 3D Shape Estimation of Multiview RGB Images from Deep Convolutional Network
AU - Han, Byung Kil
AU - Park, Jongwoo
AU - Seo, Hyunuk
AU - Song, Sung Hyuk
N1 - Publisher Copyright:
© ICROS 2022.
PY - 2022
Y1 - 2022
N2 - In this paper, we describe point cloud generation method from multiple RGB images based on convolutional neural network. The proposed method is motivated from human’s 3d understanding from orthographic sketches, which infer spatial relationship from front, top, side view of the 3d object. The network of the proposed method employed generative model, to make it predict point clouds of atypical objects. Auto encoder network is utilized to encode three RGB images into latent vectors, and generate point clouds. Loss functions are defined which measure reconstruction performance and uniformity of the point clouds to make the network generate point clouds similar to the original and distribute uniformly along entire region. From this structure, we expected the network to learn spatial relation of the original 3d model from multiple RGB images. The result shows that the proposed method can predict overall shape of the objects, and it is hard to express detailed geometry. As a further work, network structure improvement to generate detailed shape of the object, predict occluded region will be performed.
AB - In this paper, we describe point cloud generation method from multiple RGB images based on convolutional neural network. The proposed method is motivated from human’s 3d understanding from orthographic sketches, which infer spatial relationship from front, top, side view of the 3d object. The network of the proposed method employed generative model, to make it predict point clouds of atypical objects. Auto encoder network is utilized to encode three RGB images into latent vectors, and generate point clouds. Loss functions are defined which measure reconstruction performance and uniformity of the point clouds to make the network generate point clouds similar to the original and distribute uniformly along entire region. From this structure, we expected the network to learn spatial relation of the original 3d model from multiple RGB images. The result shows that the proposed method can predict overall shape of the objects, and it is hard to express detailed geometry. As a further work, network structure improvement to generate detailed shape of the object, predict occluded region will be performed.
KW - Deep learning
KW - Multiview 3d shape estimation
KW - Point cloud
UR - http://www.scopus.com/inward/record.url?scp=85134368463&partnerID=8YFLogxK
U2 - 10.5302/J.ICROS.2022.22.0075
DO - 10.5302/J.ICROS.2022.22.0075
M3 - Article
AN - SCOPUS:85134368463
SN - 1976-5622
VL - 28
SP - 671
EP - 677
JO - Journal of Institute of Control, Robotics and Systems
JF - Journal of Institute of Control, Robotics and Systems
IS - 7
ER -