TY - GEN
T1 - Voice-Controlled Navigation for TurtleBot3 using ROS Noetic and Speech Recognition
AU - Kasno, Mohammad Afif
AU - Yatim, Norhidayah Mohamad
AU - Kang, Tae Won
AU - Hee, Jo Min
AU - Nam, Jun Woo
AU - Jung, Jin Woo
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This study presents a unique approach for operating the TurtleBot3 Waffle-Pi robot with voice instructions using the ROS Noetic framework. The system uses the SpeechRecognition library to translate voice inputs into movement commands, with the goal of improving user engagement and providing an intuitive control mechanism that can be applied to a variety of domains including assistive robotics. The system naturally interfaces with ROS Noetic, which uses Python for scripting and control logic. The method entails real-time recording audio with a microphone, filtering it to remove noise, then interpreting commands with Google's speech recognition API. For the TurtleBot3, commands like 'forward,' 'back,' 'left,' 'right,' and 'stop' are assigned to matching movements. Noise filtering and command disambiguation problems are solved by modifying the recognizer's energy threshold and using a dynamic energy threshold. A complete test was carried out in controlled situations to determine command recognition accuracy, response time, and navigation precision. The results showed good reliability and effectiveness, with accurate command recognition and minimum reaction delay.
AB - This study presents a unique approach for operating the TurtleBot3 Waffle-Pi robot with voice instructions using the ROS Noetic framework. The system uses the SpeechRecognition library to translate voice inputs into movement commands, with the goal of improving user engagement and providing an intuitive control mechanism that can be applied to a variety of domains including assistive robotics. The system naturally interfaces with ROS Noetic, which uses Python for scripting and control logic. The method entails real-time recording audio with a microphone, filtering it to remove noise, then interpreting commands with Google's speech recognition API. For the TurtleBot3, commands like 'forward,' 'back,' 'left,' 'right,' and 'stop' are assigned to matching movements. Noise filtering and command disambiguation problems are solved by modifying the recognizer's energy threshold and using a dynamic energy threshold. A complete test was carried out in controlled situations to determine command recognition accuracy, response time, and navigation precision. The results showed good reliability and effectiveness, with accurate command recognition and minimum reaction delay.
KW - Mobile Robotics
KW - ROS Noetic
KW - Speech Recognition
KW - TurtleBot3
KW - Voice Control
UR - http://www.scopus.com/inward/record.url?scp=85217673881&partnerID=8YFLogxK
U2 - 10.1109/ICTC62082.2024.10826802
DO - 10.1109/ICTC62082.2024.10826802
M3 - Conference contribution
AN - SCOPUS:85217673881
T3 - International Conference on ICT Convergence
SP - 1970
EP - 1971
BT - ICTC 2024 - 15th International Conference on ICT Convergence
PB - IEEE Computer Society
T2 - 15th International Conference on Information and Communication Technology Convergence, ICTC 2024
Y2 - 16 October 2024 through 18 October 2024
ER -