The Smartphone and the Driver’s Cognitive Workload: A Comparison of Apple, Google, and Microsoft’s Intelligent Personal Assistants

AAA Foundation for Traffic Safety · 2015 · AAA Foundation for Traffic Safety

archive: archived pipeline: cataloged verified

Get this paper ↗ (full text — opens at the source; we link to it, we don't host it)

Summary

This study investigates the cognitive workload imposed on drivers by voice-based interactions with three intelligent personal assistants: Apple’s Siri, Google’s Google Now, and Microsoft’s Cortana. Motivated by the rising prevalence of driver distraction and the development of NHTSA guidelines for in-vehicle electronic interfaces, the research aims to quantify how these smartphone systems affect driving performance compared to single-task driving and a high-workload baseline (the Operation Span task). The study seeks to determine if voice-based technology, often perceived as safer than manual interaction, still poses significant cognitive demands and whether differences exist between the platforms. The researchers conducted two on-road experiments using an instrumented vehicle on suburban roadways in Salt Lake City. Experiment 1 involved 31 participants performing number dialing, contact calling, and music selection tasks. Experiment 2 involved 34 participants sending voice-dictated text messages. Cognitive workload was measured using a Detection Response Task (DRT) to assess reaction time and hit rate, subjective ratings via the NASA TLX survey, and video analysis of vehicle speed, task completion time, and system errors. Participants used identical headphones and phone mounts to control for hardware variables. The design included a single-task baseline, the three smartphone conditions, and the OSPAN task as a high-workload anchor. Results indicated that cognitive workload was significantly higher during smartphone interactions than during single-task driving. Google’s system imposed lower cognitive demands than Apple’s Siri and Microsoft’s Cortana, which did not differ significantly from each other. Specifically, DRT reaction times and hit rates were better for Google than for Apple or Microsoft during off-task periods. During active interaction ("on-task"), the cognitive workload for Apple and Microsoft was comparable to the high-demand OSPAN task, whereas Google remained significantly less demanding. Video analysis revealed that the performance differences were associated with the number of system errors, task completion times, and subjective ratings of intuitiveness and complexity; Microsoft was rated as the least intuitive and most complex. Additionally, the study found residual cognitive costs that persisted after task completion, taking significant time to dissipate. The findings suggest that caution is warranted when using smartphone voice-based technology while driving, as these interactions impose high levels of cognitive workload that can impair driving performance. The study highlights that not all voice interfaces are equal; system design, error rates, and intuitiveness significantly influence driver workload. The results imply that while voice commands may reduce visual-manual distraction, they do not eliminate cognitive distraction, and some systems may be as mentally demanding as complex memory tasks. This has implications for the development of safety guidelines and the design of in-vehicle interfaces, emphasizing the need for systems that minimize cognitive load and error rates.

Key finding

During on-road voice interactions, Google Now imposed lower cognitive workload than Apple Siri and Microsoft Cortana, while on-task performance for all three systems often matched the demanding OSPAN baseline (Category 5 on the cognitive distraction scale).

Methodology

on_road

Sample size: N=31 (Experiment 1) and N=34 (Experiment 2) on-road driving studies

Provenance

The full processing record for this entry. Every stage of this paper's journey through the pipeline is logged — what ran, with which tool and model, how many attempts it took, and when it last completed. Discovered via bulk_ingest_aaa_foundation on 2026-05-23 (5 acquisition events logged).

Stage	Outcome	Tool	Model	Prompt	Attempts	Completed
discover	success	aaa_foundation	—	—	2	2026-05-23
archive	success	—	—	—	1	2026-05-23
extract	success	cached	—	—	2	2026-06-10
clean	success	—	—	—	1	2026-06-01
chunk	success	—	—	—	1	2026-06-01
embed	success	—	—	—	1	2026-06-02
enrich	success	—	—	—	1	2026-05-23
promote	success	—	—	—	1	2026-05-23
summarize	success	llm	qwen3.6-27b-prismaquant	summ-v5	2	2026-06-10
tag	success	vector_similarity	—	—	19	2026-06-11
verify	success	—	—	—	2	2026-06-10

Summary generated by qwen3.6-27b-prismaquant on 2026-06-10; verification: verified.

Topics

Ranked by relevance to this paper. Hover a topic for its definition.

Information type

What kind of knowledge this paper contributes, grouped by family — independent of topic (what it is about) and method (how it was studied).

Empirical Findings: behavioral performance data, self report data
Theoretical Contribution: theory or model