A Deep Reinforcement Learning Based Motion Cueing Algorithm for Vehicle Driving Simulation

Scheidel, Hendrik; Asadi, Houshyar; Bellmann, Tobias; Chen, I-Ming; Mohamed, Shady; Nahavandi, Saeid · 2024 · IEEE Transactions on Vehicular Technology

DOI: 10.1109/tvt.2024.3375941

archive: archived pipeline: cataloged verified

Get this paper ↗ (DOI — opens at the source; we link to it, we don't host it)

Summary

This paper addresses the challenge of designing Motion Cueing Algorithms (MCAs) for vehicle driving simulation platforms (MSPs). Existing MCA approaches, such as classical washout filters or Model Predictive Control (MPC), often suffer from suboptimal performance due to linearization and filtering artifacts, or they fail to meet real-time computational requirements. The authors propose a novel solution using Deep Reinforcement Learning (DRL) to autonomously learn an optimal control strategy. Unlike traditional methods where human engineers specify control principles, this approach allows an AI agent to learn the optimal motion through trial-and-error interaction with a simulated MSP, aiming to reproduce the driver’s motion perception accurately while respecting workspace limits. The methodology employs the Proximal Policy Optimization (PPO) algorithm, a policy gradient method implemented via the Stable-Baselines3 library in Python. The problem is formulated as a Markov Decision Process (MDP) where the agent interacts with an environment comprising a simulated MSP and a vehicle dynamics model. The agent’s actions consist of the rate of change of translational acceleration and angular velocity, rather than direct position commands, to ensure stability during stochastic exploration. The state vector includes simulator kinematics and target vehicle signals. A complex reward function guides the learning process, penalizing deviations in specific forces and angular velocities, directional fidelity errors, and workspace violations, while encouraging tilt coordination. The training data consists of pre-recorded lateral maneuvers, such as lane changes, generated using a dynamics model of the DLR ROboMObil. The study demonstrates that the DRL-based MCA successfully learns a control strategy that improves immersion quality compared to established methods. The trained Artificial Neural Network (ANN) effectively maps states to actions, allowing for real-time execution with minimal computational resources, thereby overcoming the latency issues associated with MPC-based approaches. The results indicate that the RL algorithm accurately reproduces perceived motion signals determined by a vestibular system model and utilizes the MSP’s workspace more economically. By avoiding the phase shifts and inefficiencies inherent in filter-based methods, the ANN-based MCA provides a flexible and computationally efficient solution. The significance of this work lies in its introduction of the first fully ANN-based MCA realized through deep reinforcement learning. This approach eliminates the need for explicit human specification of control logic, allowing the AI to discover creative solutions for complex, non-linear workspaces, such as those found in robotic motion platforms. The findings suggest that DRL can replace computationally heavy optimization processes with lightweight inference, enabling high-fidelity motion simulation in real-time applications. This advancement has implications for improving the realism and efficiency of simulators in aerospace, automotive, and training industries.

Key finding

A deep reinforcement learning-based motion cueing algorithm improves the quality of motion simulation immersion and resource efficiency compared to established methods while meeting real-time computational requirements.

Methodology

simulation_modeling

Provenance

The full processing record for this entry. Every stage of this paper's journey through the pipeline is logged — what ran, with which tool and model, how many attempts it took, and when it last completed. Discovered via author_sweep_intake on 2026-05-28 (2 acquisition events logged).

Stage	Outcome	Tool	Model	Prompt	Attempts	Completed
discover	success	author_sweep	—	—	3	2026-05-29
archive	success	canonical_url	—	—	5	2026-06-06
extract	success	cached	—	—	3	2026-06-10
clean	success	clean	—	—	1	2026-06-04
chunk	success	chunk	—	—	1	2026-06-04
embed	success	embed	Qwen/Qwen3-Embedding-8B	—	1	2026-06-04
enrich	success	—	—	—	1	2026-05-28
promote	success	—	—	—	1	2026-06-04
summarize	success	llm	qwen3.6-27b-prismaquant	summ-v5	2	2026-06-10
tag	success	vector_similarity	—	—	15	2026-06-11
verify	success	—	—	—	2	2026-06-10

Summary generated by qwen3.6-27b-prismaquant on 2026-06-10; verification: verified.

Topics

Ranked by relevance to this paper. Hover a topic for its definition.

Information type

What kind of knowledge this paper contributes, grouped by family — independent of topic (what it is about) and method (how it was studied).

Methodological Resource: tool software
Theoretical Contribution: computational model