Deep InSight: A Driver-State Estimation Platform for Processing Naturalistic Driving Data

Sharma, Anuj; Sarkar, Soumik; Hegde, Chinmay; Ozcan, Koray; Velipasalar, Senem; Rizzo, Matthew; Merickel, Jennifer; Adu-Gyamfi, Yaw · 2024 · ROSA P / United States. Department of Transportation. Federal Highway Administration. Office of Research, Development, and Technology

archive: archived pipeline: cataloged verified

Get this paper ↗ (full text — opens at the source; we link to it, we don't host it)

Summary

This paper presents Deep InSight, a cloud-based artificial intelligence platform designed to automate the estimation of driver states from naturalistic driving study (NDS) data. The research addresses the significant challenges associated with analyzing large-scale NDS datasets, which are critical for understanding crash risks but require time-consuming and expensive manual annotation. By developing a robust system for data storage, automated annotation, and model deployment, the project aims to enhance the efficiency and scalability of transportation safety research. The platform was developed by a multi-university team and utilizes high-performance computing to manage raw, processed, and annotated data securely. The researchers trained and tested various machine learning models using the Mind and Brain Health Labs dataset, which includes over 500,000 miles of driving data from 143 participants, and a newly created synthetic distracted driving dataset. The methodology involved developing recurrent neural network models for head pose estimation, vision-language models for behavior analysis, and segmentation algorithms for maneuver detection. Additionally, the team explored generative adversarial networks to enhance low-quality video footage. Key findings demonstrate that the platform’s automated models significantly outperform traditional baselines. For head pose estimation, a bidirectional gated recurrent unit model achieved an 11 percent accuracy improvement over baseline methods, effectively handling challenging angles where drivers look down or to the side. For distracted driving detection, the DriveCLIP framework, which leverages contrastive language-image pretraining, outperformed traditional deep learning models across multiple datasets, particularly in zero-shot transfer scenarios. Driver maneuver detection models, utilizing an energy-maximization algorithm for event segmentation, achieved accuracies up to 98.99 percent. However, experiments with video quality enhancement revealed that standard generative adversarial networks struggled to improve detection performance in low-light conditions, often introducing artifacts or noise. The Deep InSight platform is currently assessed at Technology Readiness Level 4, indicating validation in a laboratory environment. It serves as a centralized repository for models and datasets, facilitating collaboration and benchmarking among research teams. The study concludes that automating annotation and leveraging advanced AI models can substantially reduce the labor required for NDS analysis. Future work includes expanding operational requirements, testing the platform with diverse university datasets, and developing a new Naturalistic Health and Mobility Data platform to further integrate health and mobility observations.

Key finding

The Deep InSight platform successfully automated driver-state estimation, with its bidirectional gated recurrent unit model outperforming baselines by 11 percent in head pose accuracy and its CLIP-based framework achieving state-of-the-art performance in zero-shot distracted driving detection.

Methodology

lab_experiment

Sample size: 243

Provenance

The full processing record for this entry. Every stage of this paper's journey through the pipeline is logged — what ran, with which tool and model, how many attempts it took, and when it last completed. Discovered via bulk_ingest_rosap on 2026-05-23 (6 acquisition events logged).

Stage	Outcome	Tool	Model	Prompt	Attempts	Completed
discover	success	rosap	—	—	2	2026-05-23
archive	success	—	—	—	1	2026-05-23
extract	success	cached	—	—	2	2026-06-10
clean	success	—	—	—	1	2026-06-01
chunk	success	—	—	—	1	2026-06-01
embed	success	—	—	—	1	2026-06-02
enrich	success	—	—	—	1	2026-05-23
promote	success	—	—	—	1	2026-05-23
summarize	success	llm	qwen3.6-27b-prismaquant	summ-v5	3	2026-06-10
tag	success	vector_similarity	—	—	19	2026-06-11
verify	success	—	—	—	2	2026-06-10

Summary generated by qwen3.6-27b-prismaquant on 2026-06-10; verification: verified.

Topics

Ranked by relevance to this paper. Hover a topic for its definition.

Information type

What kind of knowledge this paper contributes, grouped by family — independent of topic (what it is about) and method (how it was studied).

Methodological Resource: tool software, dataset resource
Theoretical Contribution: computational model