Engineering Design Automation via Imitation Learning and Reinforcement Learning

Bozorgmehry Boozarjomehry, Ghazal

Engineering Design Automation via Imitation Learning and Reinforcement Learning

Files

ucalgary_2025_bozorgmehryboozarjomehry_ghazal.pdf (1.08 MB)

Date

2025-01-31

Authors

Bozorgmehry Boozarjomehry, Ghazal

Abstract

Reinforcement Learning (RL) has achieved notable success in robotics and gaming, yet its application to automating engineering design faces significant challenges, including slow training times and poor generalization. Traditional RL methods require exploring millions of design states, which is computationally expensive, especially when dealing with complex physics models. In contrast, behavioral cloning, which enables RL agents to mimic human designers based on their decision data, presents a more resource-efficient alternative. This thesis investigates the use of both RL and imitation learning to automate engineering design, using aircraft design as a surrogate task to model engineering design. We evaluate the performance of a behavioral cloning agent trained on human design decision data, employing recurrent neural networks such as GRU, LSTM, and simple-RNN. We define a metric Q-score, which quantifies design quality on a scale between 0 and 1, with higher values indicating better design quality. Our findings demonstrate that the GRU architecture outperforms both LSTM and simple-RNN in terms of accuracy, achieving a Q-score of 0.8 after training on a relatively small dataset. The GRU model strikes an optimal balance between accuracy, simplicity, and computational efficiency, making it particularly suitable for surrogate design tasks like aircraft design optimization. Additionally, we assess the performance of RL agents, specifically Proximal Policy Optimization and Advantage Actor-Critic ,in the same design task. Both RL approaches achieved higher Q-scores (up to 0.99) but incurred significant computational costs and required extensive training time. In contrast, behavioral cloning provided a faster, more computationally efficient approach, though its performance was constrained by the availability of labeled human decision data. The results suggest that while RL methods excel in exploration and optimization, imitation learning offers a faster and more resource-efficient solution, albeit with reduced exploration and adaptability.

Citation

Bozorgmehry Boozarjomehry, G. (2025). Engineering design automation via imitation learning and reinforcement learning (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.

URI

https://hdl.handle.net/1880/120726

Collections

Open Theses and Dissertations

Full item page