TY - GEN
T1 - Object-aware hybrid map for indoor robot visual semantic navigation
AU - Wang, Li
AU - Li, Ruifeng
AU - Sun, Jingwen
AU - Zhao, Lijun
AU - Shi, Hezi
AU - Seah, Hock Soon
AU - Tandianus, Budianto
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/12
Y1 - 2019/12
N2 - In order to achieve an intuitive interaction and visual semantic navigation for the indoor robot, we propose a novel object-aware hybrid map. The existing map is usually a metric map, lacking semantics for interaction. We combine objects in the indoor environment with the metric map to constitute a hybrid map. The map consists of a 3D object semantic map and a 2D occupancy grid map, which transfers human commands to the grid map through object semantics, thereby enabling autonomous navigation for the robot. We utilize ORB-SLAM2 for continuous pose estimation and 3D mapping. 2D object detection in key-frames is conducted based on YOLO v3. The object point clouds in multiple perspectives are merged and a 3D bounding box of the object is estimated. These objects construct a 3D semantic map. Furthermore, we project a 3D point cloud map into a 2D plane in order to get an occupancy grid map. Finally, these two maps are combined forming an object-aware hybrid map. We conduct experiments in real environments in order to verify the feasibility and robustness of the hybrid map for robot semantic navigation.
AB - In order to achieve an intuitive interaction and visual semantic navigation for the indoor robot, we propose a novel object-aware hybrid map. The existing map is usually a metric map, lacking semantics for interaction. We combine objects in the indoor environment with the metric map to constitute a hybrid map. The map consists of a 3D object semantic map and a 2D occupancy grid map, which transfers human commands to the grid map through object semantics, thereby enabling autonomous navigation for the robot. We utilize ORB-SLAM2 for continuous pose estimation and 3D mapping. 2D object detection in key-frames is conducted based on YOLO v3. The object point clouds in multiple perspectives are merged and a 3D bounding box of the object is estimated. These objects construct a 3D semantic map. Furthermore, we project a 3D point cloud map into a 2D plane in order to get an occupancy grid map. Finally, these two maps are combined forming an object-aware hybrid map. We conduct experiments in real environments in order to verify the feasibility and robustness of the hybrid map for robot semantic navigation.
KW - 3D object detection
KW - Hybrid map
KW - Robot semantic navigation
UR - https://www.scopus.com/pages/publications/85079050768
U2 - 10.1109/ROBIO49542.2019.8961495
DO - 10.1109/ROBIO49542.2019.8961495
M3 - 会议稿件
AN - SCOPUS:85079050768
T3 - IEEE International Conference on Robotics and Biomimetics, ROBIO 2019
SP - 1166
EP - 1172
BT - IEEE International Conference on Robotics and Biomimetics, ROBIO 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 IEEE International Conference on Robotics and Biomimetics, ROBIO 2019
Y2 - 6 December 2019 through 8 December 2019
ER -