Multi-simulated deep reinforcement learning based conversational robot arm/ Kyle Mhiron P. Bumagat, Arianne Joy D. Evangelista, John Louis D. Lagramada, Jhon Mark G. Remollo, Nicamae G. Tamundong, and Althea R. Villanueva.--

By:

Bumagat, Kyle Mhiron P [author]

Contributor(s):

Material type: Text

TextPublication details: Manila: Technological University of the Philippines, 2025.Description: x, 90pages: 29cmContent type:

Media type:

Carrier type:

Subject(s):

LOC classification:

BTH TK 870 B86 2025

Dissertation note: College Of Engineering.-- Bachelor of science in electronics engineering: Technological University of the Philippines, 2025. Summary: Adaptive control that dynamically responds to perturbations is a difficult problem to solve using classical techniques in robot manipulation. This raises a critical issue in creating a fully collaborative robotic system. In this study, deep reinforcement learning was used to train a robot arm in reaching a specific 3D point in space using 4096 parallel environments. The rewards were shaped to minimize action rate and velocity under a curriculum with domain randomization. This study simulated, gain-tuned, and trained the agents using the Isaac Lab framework. Within 6.24 minutes, a performant model arrived, essentially compressing months of training to minutes. Proxima policy optimization (PPO) was used with a multi-layer perceptron (MLP) neural network backbone. A modular ROS 2 package was also presented for bridging natural language understanding to physical systems. The package employs both cloud and edge computing of state-of-the-art speech- to-text, text-to-speech, and large language models. The developed system was evaluated for its educational impact using a structured learning module. Results showed significant improvements between the pre-test and post-test, with a t-value of -19.329 and p < .001. The mean scores for the evaluated factors: Attention, Relevance, Confidence, and Satisfaction ranged from approximately 3.9 to 4.3, with Cronbach’s alpha values indicating high reliability. The Structural Equation Model (SEM) indicated a good fit, SRMR = 0.071. Future enhancements should focus on increasing sample size, adding beginner-friendly resources, and expanding advanced ROS 2 topics. Further development of the conversational robot arm with interactive features can also improve student engagement and learning.

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Shelving location	Call number	Copy number	Status	Date due	Barcode
Bachelor's Thesis COE	TUP Manila Library	Thesis Section-2nd floor	BTH TK 870 B86 2025 (Browse shelf(Opens below))	c.1	Not for loan		BTH0006395

Bachelor's thesis

College Of Engineering.--
Bachelor of science in electronics engineering: Technological University of the Philippines,
2025.

Includes bibliographic references and index.

Adaptive control that dynamically responds to perturbations is a difficult problem
to solve using classical techniques in robot manipulation. This raises a critical issue in
creating a fully collaborative robotic system. In this study, deep reinforcement learning
was used to train a robot arm in reaching a specific 3D point in space using 4096 parallel
environments. The rewards were shaped to minimize action rate and velocity under a
curriculum with domain randomization. This study simulated, gain-tuned, and trained the
agents using the Isaac Lab framework. Within 6.24 minutes, a performant model arrived,
essentially compressing months of training to minutes. Proxima policy optimization (PPO)
was used with a multi-layer perceptron (MLP) neural network backbone. A modular ROS
2 package was also presented for bridging natural language understanding to physical

systems. The package employs both cloud and edge computing of state-of-the-art speech-
to-text, text-to-speech, and large language models. The developed system was evaluated

for its educational impact using a structured learning module. Results showed significant
improvements between the pre-test and post-test, with a t-value of -19.329 and p < .001.
The mean scores for the evaluated factors: Attention, Relevance, Confidence, and
Satisfaction ranged from approximately 3.9 to 4.3, with Cronbach’s alpha values indicating
high reliability. The Structural Equation Model (SEM) indicated a good fit, SRMR = 0.071.
Future enhancements should focus on increasing sample size, adding beginner-friendly
resources, and expanding advanced ROS 2 topics. Further development of the
conversational robot arm with interactive features can also improve student engagement
and learning.

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer