Learning Latent Traits for Simulated Cooperative Driving Tasks

DeCastro, Jonathan; Gopinath, Deepak; Rosman, Guy; Sumner, Emily; Hakimi, Shabnam; Stent, Simon · 2022 · arXiv (Cornell University)

DOI: 10.48550/arxiv.2207.09619

archive: archived pipeline: cataloged verified

Get this paper ↗ (DOI — opens at the source; we link to it, we don't host it)

Summary

This paper addresses the challenge of creating adaptive human-machine teaming strategies for cooperative driving, specifically focusing on how AI assistants can tailor their interventions to individual driver traits. Current driver-assistance systems are largely non-adaptive, offering help irrespective of a driver’s specific needs, which can reduce trust and performance. The authors propose a framework that learns a compact latent representation of human drivers based on behavioral patterns and preferences, allowing the system to deploy personalized interaction policies. The motivation is to move beyond reactive systems by embedding drivers in a latent space of behavior types, enabling the AI to distinguish between traits such as distractibility and cautiousness to provide appropriate assistance. To achieve this, the authors introduce a lightweight simulation environment called HMIway-env, built upon highway-env, which models distracted driving behaviors including distractibility, cautiousness, and receptivity to alerts. The framework employs a trait- and preference-aware Adversarial Inverse Reinforcement Learning (IRL) approach. This method uses a generative adversarial network structure where a discriminator learns a reward function and a context encoder (using an LSTM) to capture driver traits from trajectory data, while a generator learns a policy conditioned on these latent traits. The training process utilizes average pooling to aggregate behavior history for individual drivers and applies specific loss functions to encourage mutual information between trajectories and latent variables, as well as contrastive separation of different driver types. The study simulates four distinct driver archetypes—Lisa, Marge, Bart, and Homer—varying in confidence, attentiveness, and alert receptivity. The results demonstrate that the framework can effectively discriminate between different driver types and learn appropriate intervention policies. By aggregating observations over multiple driving samples, the model successfully identifies latent structures corresponding to cognitive factors like distraction and caution. The simulation validates that the learned policies can distinguish between drivers who ignore assistance (e.g., Lisa, Bart) and those who are receptive (e.g., Marge, Homer), as well as those prone to distraction. The approach proves capable of generating effective interaction strategies tailored to these specific behavioral profiles within the simulated highway merge scenarios. The significance of this work lies in its potential to enhance human-AI cooperation in safety-critical domains like driving. By leveraging latent representations of driver traits, the system can personalize assistance, potentially improving task performance and trust. The proposed framework offers a data-driven method for uncovering individual preferences from observational data, addressing the limitation of large-scale driving datasets that often lack direct measures of driver traits. This approach lays the groundwork for more nuanced, human-aware autonomous systems that can adapt their support to the unique characteristics of each user.

Key finding

The proposed framework successfully learns latent representations of driver traits from simulated data, enabling the discrimination of driver types and the deployment of more effective, personalized intervention policies.

Methodology

simulation_modeling

Provenance

The full processing record for this entry. Every stage of this paper's journey through the pipeline is logged — what ran, with which tool and model, how many attempts it took, and when it last completed. Discovered via author_sweep_intake on 2026-05-28.

StageOutcomeToolModelPromptAttemptsCompleted
discover success author_sweep 2 2026-05-28
archive success canonical_url 1 2026-06-04
extract success cached 3 2026-06-10
clean success clean 1 2026-06-04
chunk success chunk 1 2026-06-04
embed success embed Qwen/Qwen3-Embedding-8B 1 2026-06-04
enrich success 1 2026-05-28
promote success 1 2026-06-04
summarize success llm qwen3.6-27b-prismaquant summ-v5 2 2026-06-10
tag success vector_similarity 15 2026-06-11
verify success 2 2026-06-10

Summary generated by qwen3.6-27b-prismaquant on 2026-06-10; verification: verified.

Topics

Ranked by relevance to this paper. Hover a topic for its definition.

Information type

What kind of knowledge this paper contributes, grouped by family — independent of topic (what it is about) and method (how it was studied).